System, method, and computer program product for accessing a note database having subnote information for the purpose of manipulating subnotes linked to portions of documents

ABSTRACT

The system synchronizes, displays, and manipulates text and image documents in electronic form for display. The text and image files are synchronized to produce Equivalent Files to create an approximate equivalence relationship between the text and the image files. The graphic user interface of the present system allows a user to selectively view an Equivalent File in a window while simultaneously or alternatively viewing an image file within one or more image windows on the display. The user may also create and manipulate notes and subnotes as annotations to document objects or document portions. The notes and subnotes may be linked to text documents, image documents, text objects, or other non-text objects or documents, such as images, audio clips, etc. The user may create new subnotes associated with selected objects or selected portions, and the user may enter information pertaining to the selected objects in the new subnote. The new subnotes may be created in a particular notes window, or if a note window is not open at the time that the user selects an object or document portion, a new note may be created in response to the selected object. The notes and subnotes may be stored in a note database.

This is a continuation of Ser. No. 08/832,971 filed Apr. 4, 1997, nowU.S. Pat. No. 5,809,318, which is a continuation of Ser. No. 08/423,676filed Apr. 18, 1995, now U.S. Pat. No. 5,623,679, issued Apr. 22, 1997,which is a continuation-in-part of Ser. No. 08/341,129 filed Nov. 18,1994, abandoned, which is a continuation-in-part of Ser. No. 08/155,752filed Nov. 19, 1993, now U.S. Pat. No. 5,623,681 issued Apr. 22, 1997,all of which are herein incorporated by reference in their entireties.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to the fields of publishing, documentediting and manipulation, and displaying documents and images. Moreparticularly, the present invention relates to paginating, extracting,synchronizing, and displaying, a document in electronic form.

2. Art Background

As the development of multimedia computer display systems continues toadvance, more computing power and features are available to computerusers. For example, information which has historically been limited topublished paper documents is now being made available through on-linecomputing services from publishers and information vendors. As anincreasing market share of the data and computing capacity is providedthrough low cost high performance personal computers, some of theon-line information is also being made available in compact disks (CD)and magnetic media formats. Compact disk and magnetic media technologyoffer cost effective mass storage of documents, images and other data,in a format readily accessible for use with personal computers in a homeor office environment. The combination of personal computers, compactdisk technology and multimedia interactive graphic user interfaces,permits the access and display of textual and graphic information bypersonal computer (PC) users in a manner not previously known in theindustry. The type of information potentially available to a PC userincludes professional and technical publications, newspapers, magazines,and other scientific and literary data and images.

However, much of the information which is published through, forexample, government sources, newspapers and magazines is not in machinereadable form, but rather is printed on paper. Because of the amount ofwork and effort required to convert the printed information into amachine readable form, only a small portion of the total publishedinformation is currently available for use by PC users using magneticdisks, CDs and the like. In addition, the information which is inmachine readable form is typically available either as an image of theoriginal document or as a stream of text data. An image of a documenthas the advantage of presenting the information in its original formatas published, including non-text material, such as drawings, equations,symbols, diagrams, etc. The viewer is familiar with the format, and theinformation is easily recognized and understood. However, since adocument image is often stored as a bitmap, the content of the documentcannot be easily searched or manipulated. Alternatively, a text datastream format has the advantage of presenting the information in amanipulable and searchable format. Unfortunately, in many cases, theformat of presentation is not the format in which the information wasoriginally published in print. Thus, the users are often unfamiliar withthe format, inhibiting easy navigation of the document makinginformation difficult to find and use.

One example of the problem of reproducing originally published documentsstored in machine readable form, is the storage and display of UnitedStates patent documents by the United States Government. The UnitedStates Patent Office (herein referred to as the "PTO") provides magnetictapes of issued U.S. patents and other documents, in the form of ascanned in image, and as a separate stream of text data. The magnetictape storing the text data does not include graphical illustrations suchas drawings, charts, textual tables, or much in the way of formattingdata. Thus, the reproduction of a United States patent from PTO TextFiles stored on magnetic tape does not result in the display of a U.S.patent as originally published by the U.S. Government. An example of awell known system for displaying text files provided by the PTO is thatof the LexPat® system provided by Mead Data offered in conjunction withthe Lexis® display system. Using the LexPat® system, the display of aU.S. patent on a terminal, such as a PC, results in a display of textonly, and does not include drawings, charts, graphs, or originalformatting information. The text of a selected patent appears in ASCIIformat, but does not appear as the original patent issued by the PTO,and may not be referenced by the original column and line numbers fromthe published patent. Other systems display text files of periodicalssuch as the Wall Street Journal or legal documents such as contracts.However, the text files do not appear as the original documents.

The U.S. Patent Office also provides magnetic tapes with image filescomprising a scanned in image of the original U.S. patent issued by thePTO and published by the U.S. Government. The image files provided onmagnetic tape by the PTO simply represent a bitmap image of the originalpublished patent. As a scanned in image, the entire patent is providedincluding drawings, charts, graphs, text and the original format, sinceit represents a simple bitmap of the scanned original document. However,a scanned document may not be easily searched, edited, navigated orotherwise manipulated as can a text file.

As will be described, the present invention provides a method andapparatus for extracting, synchronizing, displaying, navigating andmanipulating text and image documents simultaneously in electronic form.The present invention is described with particular reference for usewith U.S. patent documents, and includes the process of extractingpatent text and image data from magnetic tapes provided by the PTO,synchronizing the text and image data for recovering the original format(i.e., columns and lines) of the original published patent, anddisplaying the formatted text along with images using a unique graphicaluser interface (GUI) workbench. Although the present invention isdescribed with reference to patent documents, it will be appreciatedthat the invention has application to a variety of different types ofdocuments and applications.

The present invention's graphical user interface permits a user toselectively view ASCII text documents as well as bitmapped scannedimages simultaneously on a display. When used in conjunction with U.S.patent documents, the graphic user interface of the present inventionallows a user, such as a patent attorney, to display and manipulate bothtextual as well as graphic portions of patents. The text of a patent maybe viewed on the display as it was originally published by the PTO,including column and line numbers. Simultaneously, the user may view thefigures of a patent in the form of an image comprising a bitmap. Variousfunctions are provided by the present invention for viewing,manipulating and displaying the patent documents. In order to assist thereader in understanding of graphic user interface (GUI) technology, itis suggested that certain references be considered for background. Manyuser interfaces utilize metaphors in the design of the interface as away of maximizing human familiarity, and conveying information betweenthe user and the computer. As for the use of familiar metaphors, such asdesktops, notebooks, spread sheets, and the like, the interface takesadvantage of existing human mental structures to permit a user to drawupon the metaphor analogy to understand the requirements of theparticular computer system. (See for example, Patrick Chan "LearningConsiderations in User Interface Design: The Room Model", ReportCS-84-16, University of Waterloo, Computer Science Department, Ontario,Canada, July, 1984 and the references cited therein.) In addition, thereader is referred to the following references which describe variousaspects, methods and apparatus associated with prior art graphic userinterface design: U.S. Pat. No. Re.32,632; U.S. Pat. No. 4,931,783; U.S.Pat. No. 5,072,412; and U.S. Pat. No. 5,148,154, and the referencescited therein.

As will be described more fully below, the present invention's graphicuser interface is based on a desktop "windows" metaphor, and providesthe user with the ability to simultaneously display text and imagedocuments in both a synchronized and unsynchronized fashion, as will bemore fully described herein.

SUMMARY OF THE INVENTION

The present invention provides a method and apparatus for extracting,synchronizing, displaying, and manipulating text and image documents inmachine readable form for display. In the preferred embodiment of thepresent invention, text and image files for documents, such as forexample patent documents, are initially stored on separate magnetic tapemedia. These data files are extracted from the respective tapes andplaced onto a faster medium, such as a hard disk drive. Catalogs aregenerated of the contents of the tapes and procedures are provided forlocating and loading tapes from a tape inventory. The text and imagefiles are synchronized to produce Equivalent Files using heuristicalgorithms to create an approximate equivalence relationship between thetext and the image files. In the presently preferred embodiment, theautomatic pagination of the text and image files provides an equivalencerelationship, and a final Equivalent File is obtained through humanintervention to correct any inaccuracies still remaining after theautomatic process has been completed. However, the present inventionalso contemplates an entirely automatic pagination process which wouldrequire no human intervention to obtain a usable Equivalent File. A wordbased inverted tree index is created for the text files to allow forvery fast text searching using a graphic user interface (GUI) workbench.

The Equivalent Files and image files residing on, for example, a harddisk drive or compact disk (CD), are coupled as a resource to a computerdisplay system. The computer display system includes a computer having acentral processing unit (CPU) coupled to memory and input/output (I/O)circuitry. The computer is also coupled to a CD ROM, hard disk drive, orother mass memory device onto which the Equivalent File and image filehave been stored. The computer is coupled to a display, such as acathode ray tube (CRT) or liquid crystal display, as well as a keyboardand a cursor control device. The graphic user interface of the presentinvention is displayed by the computer on the CRT, and includes a menubar and a tool bar, each bar having a plurality of command options forselection by a user. The graphical user interface of the presentinvention permits the user to display, manipulate, and navigate theEquivalent File created using the process of the present invention, andto simultaneously view the image file on the display. In accordance withthe teachings of the present invention, the Equivalent File may besynchronized with the image file, or alteratively, an Equivalent Filemay be displayed along with a completely separate and distinct image(for example, viewing the Equivalent File of one patent while viewingthe image file of another patent). Once created, and as shown on thedisplay, the Equivalent File is displayed in substantially the samecolumn and line format as a printed patent published by the U.S.Government.

Using the graphic user interface of the present invention, a user maycreate libraries of patent text Equivalent Files and image files, aswell as open cases to include a plurality of different patents or otherdocuments. The Equivalent File may be selectively viewed on the displayin an equivalent window. The Equivalent File may be navigated,highlighted, searched, and otherwise annotated using highlights, patentand case notes. Simultaneous with the viewing of the Equivalent File ofa patent within the equivalent window, the user may view the exactportion of the image file corresponding to the display of the EquivalentFile, or any portion of an image file within one or more image windowson the display. The present invention further provides search mechanismsfor defining and searching key words chosen by the user or selected fromthe Equivalent File, or a word list. Boolean and proximity searches mayalso be performed on the Equivalent File and the results displayed. Thesearch terms may be used to search documents within the equivalentwindow of a current Equivalent File, current library of documents,documents notes (referred to herein as "patent notes" and/or "casenotes"), as well as other selected cases. The word list includes analphabetical list of all words within the selected library, document orthe like. The present invention also permits the user to display animage, for example a patent drawing image, within the image window byplacing a cursor in the text of a patent Equivalent File and signalingthe computer. In response to this signal, the computer displays the lastreferenced figure drawing within the image window. The interface of thepresent invention also permits the user to select portions of textand/or drawings within the image window, and enlarge or reduce theselected image for viewing by the user. The interface further permitsthe user to select any element number appearing on the patent drawingsin the image window. The selection of an element number in a patentdrawing results in the automatic highlighting of the first and everysubsequent occurrence of that element number in the Equivalent Filecomprising a specification and claims of the selected patent equivalentdisplayed in the equivalent window. Additionally, multiple patents,drawings and/or other documents may be viewed simultaneously on thedisplay in accordance with the teachings of the graphic user interfacecomprising the present invention. A variety of other features andfunctions are provided by the present invention for the manipulation,navigation and display of patent documents on the user interface. Theuser may display either a synchronized Image File wherein the imagedisplayed is synchronized with the Equivalent file displayed, or anunsynchronized Image File wherein the image displayed is at some pageother than the one containing the column of text in the Equivalent File.A user may also copy and paste a portion of, or the whole, EquivalentFile to notes of third party programs, such as word processors ordrawing programs as well as allowing the user to import ASCII text intothe notes from third party systems, such as deposition testimony inASCII format into patent notes that relate to the topic of thetestimony. Particularly when using the present invention with patents,it may be used to facilitate patent searching in the preparation andprosecution of patents, licensing of patents, litigation of patents,conducting infringement and validity studies of patents, producinginfringement claim charts, managing and valuing a portfolio or group ofpatents, conducting 35 U.S.C. § 112 searches on patents or pendingapplications, and many other uses which are regularly performed by apatent attorney, patent agent or technical personnel.

NOTATION AND NOMENCLATURE

In some of the detailed descriptions which follow, the present inventionis presented partly in terms of interface display images, process steps,and symbolic representations of operations of data bits within acomputer memory. These algorithmic descriptions and representations arethe means used by those skilled in the data processing arts to mosteffectively convey the substance of their work to others skilled in theart.

An algorithm is here, and generally, conceived to be a self-consistentsequence of steps leading to a desired result. These steps are thoserequiring physical manipulations of physical quantities. Usually, thoughnot necessarily, these quantities may take the form of electrical ormagnetic signals capable of being stored, transferred, combined,compared, displayed and otherwise manipulated. It proves convenient attimes, principally for reasons of common usage, to refer to thesesignals as bits, values, elements, symbols, characters, images, terms,numbers, or the like. It should be borne in mind, however, that all ofthese similar terms are to be associated with the appropriate physicalquantities and are merely convenient labels applied to these quantities.

In the present invention, the operations referred to are machineoperations performed in conjunction with a human operator. Usefulmachines for performing the operations of the present invention includegeneral purpose digital computers, digitally controlled displays orother similar devices. In all cases, the reader is advised to keep inmind the distinction between the method of operating a computer and/ordisplay system, and the method of computation itself. The presentinvention relates to methods for operating a computer and interactivedisplay system, and processing electrical or other physical signals togenerate other desired physical signals.

The present invention also relates to apparatus for performing theseoperations. This apparatus may be specially constructed for the requiredpurposes or it may comprise a general purpose computer selectivelyactivated or reconfigured by a computer program stored in the computer.The method steps presented herein are not inherently related to anyparticular computer or other apparatus. Various general purpose machinesmay be used with programs in accordance with the teachings herein, or itmay prove more convenient to construct specialized apparatus to performthe required method steps. As such, no particular programming languageis provided, as any one of a variety of languages may be utilized toimplement the invention. The required structure for a variety of thesemachines and programming environments will be apparent from thedescription given below.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of production configuration to extract textand image files, paginate the text files with the image files to produceEquivalent Files, and index the Equivalent Files.

FIG. 2 is a flow chart illustrating the sequence of steps utilized bythe present invention to extract text and image files, paginate the textfiles with the image files to produce Equivalent Files, index theEquivalent Files and display the Equivalent Files and/or Image Files ona display.

FIG. 3 is a functional block diagram illustrating a computer displaysystem incorporating the teachings of the present invention.

FIG. 4 illustrates an enlarged portion of an image file comprising thebibliography page of U.S. Pat. No. 5,165,027.

FIG. 5 illustrates a sample portion of a PTO Text File for U.S. Pat. No.5,165,027 illustrated in FIG. 4.

FIG. 6 illustrates an example of the column information listed in thePTO Text File for the U.S. Pat. No. 5,165,027 illustrated in FIGS. 4 and5.

FIG. 7 illustrates the paragraph shown in FIG. 6 as it is stored in thePTO Image File for U.S. Pat. No. 5,165,027.

FIG. 8 illustrates the column line number information provided by apublished United States patent.

FIG. 9 illustrates a flow chart block diagram of the extraction processutilized by the present invention to extract PTO Text Files and PTOImage Files for magnetic tapes provided by the PTO for use by theprocessing system of the present invention to synchronize and index thetext and image files.

FIG. 10 is a flow chart illustrating the pagination process of thepresent invention to synchronize the PTO Text File and the PTO ImageFile to produce an Equivalent File.

FIG. 11 illustrates the user interface of the present invention uponsystem start including the title, menu and tool bars.

FIG. 12 illustrates the selection by a user of a down arrow function toopen a list of available cases.

FIG. 13 illustrates the present invention's use of information arrows todirect the user to currently available options for execution.

FIG. 14 illustrates the patent text toolbox of the present invention andthe display of a menu of patent section headings to assist the user innavigating a selected patent.

FIG. 15 illustrates the sub-command items available for selection by auser upon activating the Library menu option.

FIG. 16 illustrates the Set Library Directories dialog box, displayedafter selection of the Set Library Directories sub-command item on theLibrary menu.

FIG. 17 illustrates the New Library dialog box.

FIG. 18 illustrates the Open Library dialog box.

FIG. 19 illustrates the present invention's Library dialog box forworking with the library currently in use.

FIG. 20 illustrates the selection of a patent within the Intel* Library.

FIG. 21 illustrates the present invention's minimization of a library toan icon.

FIG. 22 illustrates the present invention's Update Library dialog boxfor updating the library currently in use, which in the present example,the Intel® Library.

FIG. 23 illustrates the present invention's Search Library dialog boxwhich is displayed upon selection of the Search sub-command item fromthe library menu.

FIG. 24 illustrates the present invention's Word List dialog box whichis displayed upon the activation of the Word List button function withinthe Search library dialog box.

FIG. 25 illustrates the operation of the present invention's Word Listdialog box for selecting an alphabetical tab and viewing thecorresponding list of words from the library patents.

FIG. 26 illustrates the present invention's Search Results dialog boxidentifying the number of occurrences of the search term defined by theuser in each of the library patents.

FIG. 27 illustrates the present invention's Library to Case CrossReference dialog box.

FIG. 28 illustrates the present invention's Patent Text Toolbox foroperating upon Equivalent Files displayed in an equivalent window.

FIG. 29 further illustrates the present invention's Patent Text Toolboxfor operating upon the Equivalent File within the equivalent window.

FIG. 30 illustrates the present invention's simultaneous display of anequivalent window and an image window, as well as the display of aPatent Image Toolbox for operating upon images displayed within theimage window.

FIG. 31 illustrates the present invention's simultaneous andsynchronized display of an Equivalent File in an equivalent window andenlarged image displayed in an image window on the display screen.

FIG. 32 illustrates the display of patent section headings and theability of a user to navigate the patent sections displayed within theequivalent window through the selection of section headings.

FIG. 33 illustrates the present invention's synchronization of anEquivalent File displayed in the equivalent window with the drawings ofa patent disposed in an image file displayed in an image window on thedisplay screen. The present invention links references to the figurenumbers in the Equivalent File to the figures in the image filedisplayed in the image window.

FIG. 34 illustrates the present invention's use of an outline box toidentify an area of the patent image to be enlarged.

FIG. 35 illustrates the present invention's user interface in which anEquivalent File is displayed in an equivalent window, andsimultaneously, an enlarged portion of a figure from the image file isdisplayed in the image window on the display screen.

FIG. 36 illustrates the present invention's Select Element Number dialogbox, which permits a user to input a drawing element and locate thefirst occurrence and the subsequent occurrences of the drawing elementin the Equivalent File displayed in the equivalent window.

FIG. 37 illustrates the present invention's use of highlighting tohighlight desired portions of the Equivalent File in various colors.

FIG. 38 illustrates the present invention's display of two equivalentwindows and one image window on the display screen.

FIG. 39 illustrates the Import Patents dialog box of the presentinvention.

FIG. 40 illustrates the Import Patents dialog box after the selection ofan Equivalent File to be imported.

FIG. 41 illustrates sub-command items available for selection upon theactivation of the Case menu option.

FIG. 42 illustrates the Open Case dialog box which is displayed once theOpen Case sub-command item illustrated in FIG. 41 is selected.

FIG. 43 illustrates the New Case dialog box which is displayed upon theselection of the New Case sub-command item illustrated in FIG. 41.

FIG. 44 illustrates the patent number drop down menu which permits auser to select a patent within a case for displaying.

FIG. 45 illustrates the Update Case dialog box which is displayed uponthe activation of the Update Case sub-command item illustrated in FIG.41.

FIG. 46 illustrates the search case dialog box which is displayed uponthe selection of the Search sub-command item of the Case menuillustrated in FIG. 41.

FIG. 47 illustrates the Set Case Directories dialog box which isdisplayed upon the activation of the Set Case Directories sub-commanditem illustrated in FIG. 41.

FIG. 48 illustrates the Copy to Case dialog box which is displayed uponthe selection of the Copy Case sub-command item illustrated in FIG. 41.

FIG. 49 illustrates the Backup Case dialog box which is displayed uponthe activation of the Backup Case sub-command item of FIG. 41.

FIG. 50 illustrates the Delete dialog box which is displayed upon theselection of the Delete Case sub-command item illustrated in FIG. 41.

FIG. 51 illustrates the Print dialog box of the present invention whichis displayed upon the activation of the Print sub-command itemillustrated in FIG. 41.

FIG. 52 illustrates the Print Setup dialog box which is displayed uponthe activation of the Print Setup sub-command item illustrated in FIG.41.

FIG. 53 illustrates the sub-command items available for selection uponthe activation of the Edit command option.

FIG. 54 illustrates the sub-command items available for selection by auser upon the activation of the View command option.

FIG. 55 illustrates the Preferences dialog box displayed upon theactivation of the Preferences sub-command item of FIG. 54.

FIG. 56 illustrates the Screen Layout dialog box which is displayed uponthe selection of a Screen Layout sub-command item of FIG. 54.

FIG. 57 illustrates the user interface of the present invention upon theselection of the Screen Layout of the Screen Layout dialog boxillustrating one equivalent window and one image window on the displayscreen.

FIG. 58 illustrates the user interface of the present invention in whichtwo equivalent windows are displayed side by side on the display screenafter selection of Screen Layout of the Screen Layout dialog box.

FIG. 59 illustrates the graphic user interface of the present inventionin which two equivalent windows and two image windows are displayed onthe display screen subsequent to the selection of Screen Layout of theScreen Layout dialog box.

FIG. 60 illustrates the sub-command items available for selection uponthe activation of the Window command option.

FIG. 61 illustrates the patent note menu of the present invention whichdisplays all patent notes which have been generated by a user.

FIG. 62 illustrates a patent note of the present invention.

FIG. 63 illustrates the present invention's use of multi-notes whereinmultiple patent notes may be created within a single patent note.

FIG. 64 illustrates the present invention's case note.

FIG. 65 illustrates the minimization of exemplary documents, such assearch results and the like on the display of the present invention.

FIG. 66 illustrates the present invention's Go To Section dialog boxwhich permits a user to input a patent column number and uponactivation, results in the display of the column in the Equivalent Filecorresponding to the desired patent column.

FIG. 67 illustrates the present invention's Go To section dialog boxwhich permits a user to select a section of the patent and uponactivation, results in the display of the selected section in theEquivalent window.

FIG. 68 illustrates the sub-command items available for selection by auser upon the activation of the Help command option.

FIG. 69 illustrates the About dialog box which is displayed upon theactivation of the About sub-command item illustrated in FIG. 68.

FIG. 70 illustrates the sub-command items which are available forselection by a user upon the activation of the Note command option.

FIG. 71 illustrates the case notes in Case dialog box which is displayedupon the selection of the View Case Note sub-command option illustratedin FIG. 70.

FIG. 72 illustrates the patent notes in Case dialog box which isdisplayed upon the selection of the View Patent Note sub-command itemillustrated in FIG. 70.

FIG. 73 is a simplified block diagram of a computer system according toa preferred embodiment of the present invention.

FIG. 74 is a flowchart depicting the preferred manner in which datatransfer operations occur between machines in the computer system ofFIG. 73.

FIGS. 75, 76A, and 76B are used to describe the manner in which PTOImage files are compressed according to a preferred embodiment of thepresent invention.

FIGS. 77 and 78 are flowcharts depicting the manner in which paginationis performed according to a preferred embodiment of the presentinvention.

FIGS. 79 and 80 are used to describe a "Copy Claims" option preferablyprovided by the user interface of the present invention.

FIGS. 81 and 82 are used to describe a "Zoom Image" option preferablyprovided by the user interface of the present invention.

FIG. 83 is used to describe a "Copy Image" option preferably provided bythe user interface of the present invention.

FIG. 84 is used to describe a "Lock Windows" option preferably providedby the user interface of the present invention.

FIGS. 85A and 85B are used to illustrate the preferred manner in whichthe present invention performs clumping.

FIG. 86 is used to illustrate the preferred manner in which the presentinvention performs character stream matching.

FIGS. 87 and 88 illustrate note databases according to first and secondembodiments, respectively, of the present invention.

FIG. 89 illustrates an example note window according to the secondembodiment shown in FIG. 88.

FIG. 90 illustrates an example display showing a text window, an imagewindow, and a notes window.

DETAILED DESCRIPTION OF THE INVENTION

In the following description, numerous specific details are set forthsuch as functional blocks, representative data processing devices,window configurations, specific patent documents, text and drawings,etc., to provide a thorough understanding of the present invention.However, it will be apparent to one skilled in the art that the presentinvention may be practiced without these specific details. In otherinstances, well known circuits and structures are not described indetail in order not to obscure the present invention unnecessarily.

The present invention will be described in various sections including adiscussion of the general system configuration, the tape extractionprocess, the pagination process, the indexing process, and the graphicuser interface. It is to be understood that although the followingdescription is directed to U.S. patent documents, the present inventionis not limited to patents, and has application to a variety of documentsand images, as may be required by a particular application, such as forexample, legal contracts, the Wall Street Journal, The Los AngelesTimes, etc.

General Overview of the Invention

The general system configuration of the present invention discloses onepossible implementation of the present invention for the display,navigation, manipulation and editing of text and image data in agraphical user interface. As will be described, the general systemconfiguration describes a computer display system which may be in theform of a personal computer, workstation, or dedicated processor systemto permit the user to utilize the teachings of the present invention. Noparticular computer hardware is described within this specification, andthe general system configuration description is intended to encompass abroad range of possible data processing systems in which the presentinvention may be implemented.

A general overview of the system of the present invention is shown inFIG. 1, and a flow chart of the primary process steps comprising themethod of the present invention is illustrated in FIG. 2.

The tape extraction process of the present invention extracts data filesfrom PTO text and PTO Image File magnetic tapes provided by the PTO. Thedata files are extracted from these tapes onto a faster medium (such asa hard disk drive) to provide access times which are useful in modemdata processing systems. As will be described, the process of extractioninvolves appropriately generating catalogues and inventories of thecontents of the tapes, as well as procedures for selecting and loadingtapes from the newly created tape inventories.

The process of paginating the PTO Text Files and the PTO Image Files toproduce "Equivalent Files" is performed by using a heuristic set ofalgorithms to automatically create an approximate equivalentrelationship between the text and image files. A human operator verifiesthe results to finalize the Equivalent File, such that the originalformatting of the published patent document is reflected in theEquivalent File.

As will be described, a process for creating an inverted tree index forthe text contained in the PTO Text Files is disclosed. This indexingprocess results in a pre-built index for very fast text searching whenusing the graphic user interface of the present invention. Although thepresent invention describes an inverted tape index, other types of textsearching methods may be employed, instead of the inverted tape index.

The graphic user interface ("GUI") of the present invention displays theEquivalent File and the PTO Image File, and allows the user to performanalysis on the displayed files or other stored files. The EquivalentFile is formatted and displayed with a similar appearance to the PTOImage File, having the same column and line formatting as the publishedpatent. The user may then, for example, use the GUI to perform textsearches to generate accurate column and line citations, navigate theEquivalent File via section headings to locate desired sections of text,as well as to view the figures or text images in the displayed files orother stored files. Images and equivalent patent text may be viewedeither in a synchronized or unsynchronized fashion using the teachingsof the present invention.

General System Configuration

FIG. 1 illustrates a block diagram of the present invention's productionconfiguration to extract text and image files, to paginate the textfiles to produce Equivalent Files, and to index the Equivalent Files.The process begins with the PTO magnetic tapes 1 that are of type 3480from the PTO. There are three different categories of PTO magnetictapes: PTO text tapes, PTO image tapes and PTO assignment tapes. A UNIXmachine 2 reads the data in the PTO tapes 1 into a large file buffer.The data is then parsed to find each of the documents that are on thetapes. Parsing creates a table which contains patent numbers, thephysical locations of the patent files on the tapes, the total number ofbytes and other control information about each document that appears onthe tape. A document can be either a patent, a certificate ofcorrection, a reissued patent disclaimer or any other post-issuancedocument. The data can then be either stored in a digital linear tape(DLT) 3 or in any other suitable data storage medium. Because the amountof disk storage space required for the total active set of patents isgreater than 1 terabyte (TB), currently the data is stored intolibraries S. The libraries may contain PTO Text Files 6, PTO Image Files7 and post issuance documents 9. If a disk drive system with a largeenough storage is available, the data can be stored in a disk drive. Atpresent, the PTO image tapes are left in their original medium, namelythe 3480 magnetic tapes.

Continuing to refer to FIG. 1, when an order 10 requesting a list ofpatents is entered into a UNIX database 11, the UNIX database 11 sortsthe request list by patent location to minimize the number of differenttapes that need to be mounted, and sends to the staging machine 8 thelist of patents and other pertinent information such as the volumeserial number of the tapes, and location information that allows thestaging machine 8 to fast forward to the individual patent files thatare requested. The staging machine 8 creates a file on its disks of allthe text and image portions of each patent that has been requested toprocess. When the staging machine 8 has the text and image filesavailable, it sends the text and image files to the pagination machine13.

Further referring to FIG. 1, at present, the pagination machine 13utilizes one or more DOS based machines 16 to paginate the text andimage files and to create Equivalent Files as described more fully inthe Terminology and Definition section in this Specification. Afterpagination, an index machine 19 adds post issuance documents 9 andindexes the Equivalent Files. The index machine 19 incorporates one ormore DOS based machines 20. Next, the manufacturing machine 23 creates aCD ROM image of the Equivalent Files and the Image Files and writes theimage to a CD ROM and digital linear tapes 28. The manufacturing machine23 may utilize one or more DOS based machines 27, a CD ROM writer 25 anddigital linear tapes 28. The CD ROM with the Equivalent Files and theImage Files are delivered to a user who then uses a system, such as theone illustrated in FIG. 3, to display and manipulate the files. Thedigital linear tapes with the finished patents are stored in a library30, and the database 11 is updated so that when a particular patent inthe library 30 is requested, the staging machine 8 mounts the finishedpatent from the library 30, and the database flags that the patent hasalready been paginated and indexed, so that pagination and indexingsteps can be skipped for a faster process. Although in the presentinvention, specific machines such as UNIX machines and DOS machines aredisclosed, these are mere examples of different types of computersystems that can be incorporated and not limitations upon the presentinvention.

As evident from the above description, the present invention involves asignificant amount of transfer of data between machines, such as betweenthe extraction machine, the libraries 5, the staging machine 8, and thepagination machine 13 (in practice, these machines may be implementedusing a single computer platform, or multiple computer platforms). Themanner in which such data transfer takes place according to a preferredembodiment of the present invention shall now be described withreference to FIGS. 73 and 74.

FIG. 73 is a simplified representation (in block diagram form) of thesystem configuration shown in FIG. 1. FIG. 73 shows a computer system7302 that includes a first client machine 7304, a second client machine7306, a shared disk drive 7310, and a tape drive 7308. The shared diskdrive 7310 is preferably part of the second client machine 7306, and thesecond client machine 7306 is preferably a UNIX-based machine.Preferably, the shared disk drive 7310 can be directly accessed by boththe second client 7306 and the first client 7304.

As will be appreciated, in UNIX-based systems, file rename operationsare atomic operations. Thus, the shared disk drive 7310 cannot performany other file-related operations when it is performing a file renameoperation (this is the case, since the shared disk drive 7310 is part ofthe UNIX-based second client machine 7306).

There are instances when the first client 7304 will want to access datain the tape drive 7308 via the second client 7306. Consider the casewhere the first client 7304 represents the pagination machine 13, thesecond client 7306 represents the staging machine 8, and the tape drive7308 represents the library 5. Often, the pagination machine 13 willwant to access data in the library 5. To do so, the pagination machine13 will have to interact with the staging machine 8. Preferably, suchinteraction between the pagination machine 13 and the staging machine 8is achieved by using the shared disk drive 7310. In particular, thepagination machine 13 (i.e., the first client 7304) writes a "read"command on the shared disk drive 7310. The staging machine 8 (the secondclient 7306) retrieves the "read" command from the shared disk drive7310 and then performs the "read" command, wherein such performance ofthe "read" command results in data being read from the library 5 (tapedrive 7308) and transferred to the pagination machine 13. Other datatransfer scenarios in the system configuration shown in FIG. 1 will beapparent to persons skilled in the relevant art.

As will be appreciated, handshaking must be implemented between thefirst client 7304 and the second client 7306 to ensure that the secondclient 7306 does not read the "read" command from the shared disk drive7310 before the first client 7304 has finished writing the "read"command to the shared disk drive 7310. Otherwise, improper operationwill result.

FIG. 74 is a flowchart 7402 representing the operation of the firstclient 7304 and the second client 7306 during data transfer operations.Such operation of the present invention achieves handshaking during datatransfer, operations without requiring any explicit communicationbetween the first and second clients 7304, 7306. This helps in reducingthe load on system resources (such as communication bandwidth), therebyoptimizing system performance. Flowchart 7402 begins with step 7404,where control immediately passes to step 7406.

In step 7406, the first client 7304 begins writing a read command file(which contains commands that instructs the second client 7306 to readdata from the tape drive 7308) to the shared disk drive 7310. The readcommand file is named "DLT.CXX". The second client 7306 periodicallyscans through the shared disk drive 7310 and retrieves and executesfiles with a ".CMD" extension.

Step 7408 is performed after the first client 7304 has completelywritten the file "DLT.CXX" to the shared disk drive 7310. In step 7408,the first client 7304 changes the name of the "DLT.CXX" file to"DLT.CMD". As discussed above, file rename operations are atomicoperations. Thus, the second client 7306 is not able to read the"DLT.CMD" file from the shared disk drive 7310 until the renameoperation is complete (and the rename operation is not initiated untilthe read command file has been completely written to the shared diskdrive 7310).

In step 7410, after the rename operation is complete, the second client7306 discovers that a file with a ".CMD" extension is located in theshared disk drive 7310 (i.e., the "DLT.CMD" file). The second client7306 retrieves the "DLT.CMD" file from the shared disk drive 7310, andexecutes it. Operation of flowchart 7402 is complete after step 7410 isperformed, as indicated by step 7412.

Referring now to FIG. 3, an exemplary computer display system for use inaccordance with the teachings of the present invention is shown. Thecomputer system includes a display 40, such as a CRT monitor or a liquidcrystal display (LCD), and further includes a cursor control device 42,such as a mouse of the type shown in U.S. Pat. No. Re.32,632, a trackball, joy stick, keyboard or other device for selectively positioning acursor 44 on a display screen 68 of the display 40. Typically, thecursor control device 42 includes a signal generation means, such as aswitch 46 having a first position and a second position. For example,the mouse shown and described in U.S. Pat. No. Re.32,632 includes aswitch which the user of the computer system uses to generate signalsdirecting the computer to execute certain commands. As illustrated, thecursor control means 42 (hereinafter all types of applicable cursorcontrol devices, such as mice, track balls, joy sticks, graphic tablets,keyboard inputs, and the like, are at times collectively referred to asthe "mouse 42") is coupled to a computer 48.

The computer 48 comprises three major components. The first of these isan input/output (I/0) circuit 50 which is used to communicateinformation in appropriately structured form to and from other portionsof the computer 48. In addition, the computer 48 includes a centralprocessing unit (CPU) 52 coupled to the I/O circuit 50 and a memory 55.These elements are those typically found in most general purposecomputers, and in fact, computer 48 is intended to be representative ofa broad category of data processing devices capable of generatinggraphic displays.

Also shown in FIG. 3 is a keyboard 56 to input data and commands intothe computer 48, as is well known in the art. A mass memory disk 60 isshown coupled to I/O circuit 50 to provide additional storage capabilityfor the computer 48. In addition, a CD ROM 62 and a floppy disk 64 isfurther coupled to the I/O circuit 50, for providing, as will bedescribed, a library of textual documents and images to be displayed onthe display 40. It will be appreciated that additional devices may becoupled to the computer 48 for storing data, such as magnetic tapedrives, as well as networks, which are in turn coupled to other dataprocessing systems. A printer 57 is coupled to the I/O circuit 50 forprinting documents, images, and the like, as is well known.

In one embodiment, the present invention is a computer program product(such as a floppy disk, compact disk, etc.) comprising a computerreadable media having control logic recorded thereon. The control logic,when loaded into memory 55 and executed by the CPU 52, enables the CPU52 to perform the operations described herein. Accordingly, such controllogic represents a controller, since it controls the CPU 52 duringexecution.

As illustrated in FIG. 3, the display 40 includes the display screen 68in which a window 70 is displayed. The window 70 may be in the form arectangle or other well known shape, and may include a menu bar 72disposed horizontally across the length of the window, or in any otherdesired position on the window. As is well known, the movement of themouse 42 may be translated by the computer 48 into movement of thecursor 44 on the display screen 70. The reader is referred to literaturecited in the background describing object-oriented display systemsgenerally, and in particular, desktop metaphor window-based systems foradditional description related to other computer systems which may beutilized in accordance with the teachings of the present invention. Thesystem illustrated in FIG. 3 is intended to represent a general computerdisplay system capable of providing a graphic user interface display.

In this specification, the present invention is described with referenceto the display, navigation, and manipulation of United States patentdocuments. In particular, the invention is described herein as providinga unique method and apparatus for extracting, paginating, displaying,manipulating, navigating and editing the text of issued United Statespatents, and simultaneously displaying an image of a patent includingthe drawings on the display 40. Although the description hereindescribes the invention with reference to patent documents, as haspreviously been mentioned, it will be appreciated by one skilled in theart that the present invention may be used in a variety of applicationswhich require the simultaneous display, synchronization of, orunsynchronized display of, text and images on a display. For purposes ofthis specification, all references to "patents" or documents generally,shall be understood to encompass documents of every type, and are notlimited solely to patent documents.

In addition, it will be noted that no particular programming languagehas been disclosed to implement the present invention using the computerdisplay system illustrated in FIG. 3. A variety of programming languagessuch as C, C++, Visual Basic, etc. may be used to implement the presentinvention on many different computer display platforms, using theteachings described herein.

Terminology and Definitions

A "PTO Image File" is an electronically stored data file in the formatspecified in the document: "U.S. Patent and Trademark Office APS U.S.Patent Image Data File". Each of these files contains one or more imagepages from a patent document. Each image page in a PTO Image File is anelectronic representation of an actual page of a patent or a relatedpatent document (such as a Certificate of Correction). The image pagesare created by the U.S. Patent and Trademark Office by the use of anelectronic scanner, and are stored in the PTO Image File in Group 4compressed format (see Federal Information Processing Standardspublication 150: "Facsimile Coding Schemes and Coding Control FunctionsFor Group 4 Facsimile Apparatus"). An enlarged portion of an exemplaryimage page (the bibliography page of U.S. Pat. No. 5,165,027) is shownin FIG. 4.

A "PTO Text File" is an electronically stored data file in the formatspecified in the document: "U.S. Patent and Trademark Office PatentFullText/APS File". Each of these files contains an ASCII textrepresentation of most of the textual data in a patent document.Generally, the bibliography information and the text paragraphs in themain body of the patent will be found in this file. Some equations andtables of textual information that appear in a patent will also bestored in this type of file. Visual information, such as diagrams andtables containing information of a graphical nature, and formattinginformation will not be found in the PTO Text File. In addition, thecolumn and line number information that appears on published patents isnot stored in the PTO Text File nor is the format of the bibliographicalpage.

The ASCII data in a PTO Text File is stored in fixed-length eightycharacter records. The first four characters of each record are an IDcode that identifies what type of data the record contains, the fifthcharacter is a blank, and the last seventy-five characters of the recordstore the actual data values. If the first four characters are allblanks, then the record is a continuation of the previous record.

For example, in the PTO Text File for U.S. Pat. No. 5,165,027 (part ofwhich is illustrated in FIG. 5), there is a record that begins with"TTL" followed by "Microprocessor breakpoint apparatus". This "TTL"record stores the title of the patent which is "Microprocessorbreakpoint apparatus". One of these "TTL" records is required in everypatent.

Another record, which begins with "ISD" and contains "19921117", showsthe issue date of the patent which is Nov. 17, 1992.

In both of the above examples, the amount of data to be stored isseventy-five characters or less and therefore fits in one record. Inmany instances, there is too much data to fit in one record. Paragraphsof text from the main body of the patent are often split into multiplerecords because they have more than seventy-five characters. The firstrecord of such a paragraph would start with an identifier ("ID") such as"PAR" (which indicates a paragraph whose first line is indented). Thesubsequent records used to hold the paragraph would start with an ID offour blanks indicating that these records are continuations of the firstrecord. As many words as will fit in the seventy-five characters(without breaking the words) are stored in ear-h record (see FIG. 5).

The PTO Text File stores data relating to a patent in an informationalformat using ASCII text rather than a visual display format (see FIG.5). The Text File is comprised of records that contain labeled pieces ofinformation. This is a very convenient format for processing theinformation about a patent using a computer (such as performing a textsearch or navigating the text of the patent).

The PTO Image File stores data relating to a patent in a scanned bitmapdisplay format that is very easy for a human being to work with (seeFIG. 4) since it visually appears as the original published patent. ThePTO Image File comprises a series of digitized page images that arecreated by using a page scanning device to capture black and whitepictures of typeset patent pages. This is a very convenient format forallowing a human to view the information contained in a patent. Forexample, the image pages can be printed on a laser printer to produce areadable paper document that visually displays the diagrams, equationsand figures of the patent, as it was published by the U.S. Government.

An "Equivalent File" is an electronically stored data file whichcontains pagination information that details the equivalencerelationship between a PTO Text file and a PTO Image File. Thisrelationship makes both the PTO Text file and the PTO Image File moreuseful by specifying how the record-based ASCII data of the PTO Textfile can be manipulated to be substantially equivalent in appearance tothe PTO Image File and yet still retain its useful properties as anASCII file.

"Pagination" is a process by which an Equivalent File is created from aPTO Text File and a PTO Image File. The PTO Image File is read todetermine the locations of column breaks, column number, line breaks andline numbers as well as the locations and sizes of imbedded tables,structures, equations, and other non-text information in thespecification. Pattern recognition techniques familiar to those skilledin the art are used to block and segment the layout of the image pages.

The PTO Text File is read to determine bibliographic information, figurereferences, section headings, font style, point size, superscript,subscript, boldness or presence of italicized type, and specialcharacters.

The results of these two operations are then combined either manually,or by the use of Optical Character Recognition techniques to produce theequivalent File. Each of the PTO Text File paragraphs that begins with abibliographic information ID code is formatted to approximate theappearance of the Bibliography section on a typeset PTO BibliographyImage Page. Likewise each of the text paragraphs from the PTO Text Filein the Specification and Claims sections is processed to produce a textfile formatted to approximate the appearance of the Specification orClaims section in typeset PTO Specification and Claims image page(s).

The requirement for pagination of the PTO Text File and the PTO ImageFile arises from several distinct requirements in the field of use. Inciting a patent in, for example, a legal proceeding, the specificreference is made by the column number and line number of the portion ofinterest. These column and line numbers are printed in the publishedpatent and appear in the format of the page represented by the PTO ImageFile. However, these column and line numbers do not appear in the PTOText File, making it difficult to discern a proper citation from the PTOText File. In use, a user may perform a word search on the PTO Text Fileto locate a specific term. Once located in the PTO Text File, should theuser wish to cite that reference, he or she must refer back to the PTOImage File (or the actual paper patent) to locate the exact columnnumber and line number, without the benefit of any information as tothat location.

Another requirement for pagination arises from the practice of placingpure images in line with the text in the columns of the patent. Forexample, a diagram of a structure followed by the text description ofthat structure, in the PTO Text File would appear only as text, withoutthe image of the structure. The user must refer back to the PTO ImageFile (or the paper patent) to locate and study the diagram of thestructure, again without any information regarding the physical locationof the illustration, diagram, figures or the like, from the data in thePTO Text File.

The specific information about how the typesetting equipment processesthe data from the PTO Text File to produce the PTO Image File is notavailable from the U.S. Government. Therefore, the two files mustnormally be treated as completely separate entities. (The PTO itselfuses the files separately on two computers manufactured by SunMicrosystems, Inc.) The PTO Text File is normally used to search fortext but has no information as to where or how the information appearsin the typeset patent image pages. The PTO Image File is used to viewthe typeset text, diagrams, figures, and equations but has norepresentation of the data stored in a format that can be searched by acomputer.

The purpose of the Equivalent File of the present invention is topaginate the PTO Text File so that the data in the Text file can bepresented in a paginated patent-like format, thus facilitating searchingin, and direct citation from the text, a function heretofore notavailable using the PTO Text Files. The pagination process formats thePTO Text File with correct column breaks, column numbers, end of hnebreaks, and line numbers, thus allowing direct citation, along with thebenefits of pure text searching. The information contained in theEquivalent File can be used in both a familiar visual format by a humanbeing and automatically by the computer at the same time.

A "synchronized" display is a method of navigating an Equivalent Fileand the corresponding Image File in a way that a user can view a columnin the Equivalent file and the same column in the Image Filesimultaneously. For example, when the user views column 3 of theEquivalent File in a window, he can simultaneously view column 3 of theImage File in another window. Thus, the user can view two files, anEquivalent File and an Image File, in a synchronized manner.

An "unsynchronized" display is a method of displaying one portion of anEquivalent File and another portion of an Image File asynchronously. Forexample, assume there is a sentence in column 2 of the Equivalent Filestating "referring to FIG. 5, the system illustrates . . . ." If a userselects the sentence in the Equivalent File, the Image File will displaythe first page which contains FIG. 5. Thus, the Equivalent File and theImage File do not refer to the same column, but they refer to therelated matters. Another example of an unsynchronized display isdisplaying one portion of the Equivalent File while displaying acompletely unrelated drawing, an unrelated table, or a different textportion of the Image File of the same patent or an Image File of anotherpatent. Accordingly, in an unsynchronized display, there may be norelationship or linkage between the Equivalent File and the Image Filedisplayed simultaneously.

The underlying structure of the information stored in the EquivalentFile may be stored in many forms. It may be stored in a binary structureformat for fast access by a language that implements structureoperations such as the C programming language. Another alternative is tostore some of the underlying structural information about the text in ageneralized markup language such as SGML (Standardized GeneralizedMarkup Language) and store the raw positional information in a binarystructure format. There are many alternatives having their own impact oncapabilities, speed, and ease-of-use of the present invention. Thereader may therefore implement the present invention in the particularprogramming language which best accommodates the reader's systemrequirements. As previously described, the present invention may beimplemented using a variety of computer systems, including the systemshown in FIG. 3.

The SGML may be used in a variety of applications. The SGML may be usedto write a patent application that is equivalent in appearance to apublished patent. The SGML may be also utilized to create a compounddocument that contains both the Equivalent File and bit scanned imagesof tables, flow charts, equations and the like.

Equivalent Files are associated with at least the following types ofsynchronization information:

1. Column

The positions within the PTO Text File of the first character of eachpatent text column as those columns are displayed in the PTO Image File.This permits the present invention to determine which ASCII text isdisplayed in each column of the main body of the patent.

2. Line

The positions within the PTO Text File of the first character of eachline of text as those lines are displayed in the PTO Image File. Thispermits the present invention to determine which ASCII text is displayedin each line of each column of the main body of the patent.

3. Column Line Number

The approximate line number in the patent column that each line of textin the PTO Text File is adjacent to, permitting the present invention todetermine the approximate vertical positions of the ASCII text linesdisplayed in each column of the main body of the patent.

4. Bibliographic formatting

The approximate arrangement of the bibliographic data from the PTO TextFile as it appears on the bibliographic page images in the PTO ImageFile.

5. Graphic Item Locations

The locations in the PTO Image File of the various figures, figureelements, equations, non-text tables, structures and diagrams referredto in the PTO Text File.

6. Sections

The positions within the PTO Text File of the various logical sectionsof the document (e.g., background of the invention, brief description ofthe drawings, the claims section, etc.) as they are displayed in the PTOImage File.

7. Font

The font style in which the various ASCII characters in the PTO TextFile are displayed in the PTO Image File.

8. Point Size

The font size in which the various ASCII characters in the PTO Text Fileare displayed in the PTO Image File.

9. Superscript or Subscript

Whether the various ASCII characters in the PTO Text File are displayed10. Boldnesipts or subscripts in the PTO Image File.

10. Boldness

The degree of boldness of the font style in which the various ASCIIcharacters in the PTO Text File are displayed in the PTO Image File.

11. Italics

The degree of italicness of the font style in which the various ASCIIcharacters in the PTO Text File are displayed in the PTO Image File.

12. Special Characters

Some of the ASCII characters in the PTO Text File are displayed in thePTO Image File as special characters. Typically a group of characters inthe PTO Text File (e.g., "Ω") will map to one special character in thePTO Image File (e.g., "Ω"). This is due to the ASCII standard notdefining many special characters that are useful.

As an example of the "Column" information listed above, refer to theparagraph of text from the main body of U.S. Pat. No. 5,165,027 thatbegins with "Numerous techniques are used . . . ". FIG. 6 shows how theASCII characters for this paragraph are stored in the PTO Text File. Thesame paragraph is displayed in the PTO Image File for U.S. Pat. No.5,175,027 in FIG. 7.

It should be noted that the paragraph in the PTO Text File (see FIG. 6)is 5 lines long, and that the same paragraph displayed in the PTO ImageFile (see FIG. 7) is 7 lines long. In addition, no words are brokenacross fines in the PTO Text File. Words at the ends of lines displayedin the PTO Image File may be spht so that part of the word appears atthe end of one line, followed by a hyphen, and the rest of the wordappears on the next line (e.g., "performance").

The Equivalent File is associated with line numbers to identify which ofthe ASCII characters in the PTO Text File fall in which lines displayedin the PTO Image File. For example, the Equivalent File would store thelines of the paragraph in the PTO Image File (see FIG. 7) beginning withthe following characters in the PTO Text File:

Line 1: The "N" in "Numerous".

Line 2: The "m" in the middle of "performance".

Line 3: The "d" in "development".

Line 4: The "T" in "The".

Line 5: The "p" in "part".

Line 6: The "s" in "some".

Line 7: The "t" in "that".

As an example of the "Column" information listed above, refer to thefirst page image of the specification of U.S. Pat. No. 5,165,027, shownin FIG. 8. As illustrated, the first column of the patent begins withthe "M" in "MICROPROCESSOR". The second column of the patent begins withthe "d" in "data". These positions are stored in the Equivalent File inorder to identify which of the ASCII data in the PTO Text File fallswithin which columns.

FIG. 8 also shows an example of the "Column Line Number" informationlisted above. The column of numbers that runs down the middle of thepage indicate what line numbers within the patent text columns each ofthe lines of text falls on. For column 1, shown in FIG. 8, the line thatcontains "This application is a continuation of application Ser." isline 4 of the column. In column 2, shown in FIG. 8, the line that shows"address at which a breakpoint is to occur. A second" is line 8 of thecolumn. This information is associated with the Equivalent File toidentify the approximate vertical position on the page image where agiven line of text appears.

As an example of "Bibliographic formatting" information, reference ismade to FIGS. 4 and 5. Notice that the title record which starts with"TTL"has its data displayed in bold below the words "United StatesPatent", the inventor's name, and a horizontal ruler line. Each piece ofbibliographic information is stored as a column of text in theEquivalent File.

Paginating the bibliography data from the PTO Text File to theformatting on the bibliography pages in the PTO Image Files alsoinvolves adding text labels to the Equivalent File. For example, thecharacters "United States Patent 19!" that appear at the top of everybibliography page are not found anywhere in the PTO Text File. Thesewords appear at the top of every patent so their presence in the PTOImage File is unnecessary. However, in order to create an EquivalentFile that is similar in appearance to the PTO Image File, these wordsmust be specified in the Equivalent File. The pagination algorithm isdesigned to add these text labels when they are needed.

Extraction

The extraction process of the present invention is illustrated in blockdiagram form in FIG. 9. The PTO provides the PTO Text File and PTO ImageFiles on IBM® 3480 magnetic tapes. The extraction process identifies theparticular IBM® 3480 tape that a specific PTO Text File or a PTO ImageFile is located in, extracts those files from the tape(s) and convertsthem for use by the processing system which synchronizes and indexes thefiles.

PTO Text Tapes are issued by the PTO on specific calendar dates andcontain a unique Volume Serial Number (VSN). All patents issued on acertain date should be present in the tape(s) issued on that date. Thetapes do not contain an index. Therefore, extracting a specific PTO TextFile requires that the entire 200 MB IBM® 3480 tape be read into amagnetic disk buffer and stripped of header blocks, tape marks labels,etc., and then parsed to create a Volume Table of Contents (VTOC). TheVTOC contains the document number, byte count offset from the beginningof the tape, and the length of the document file in bytes. A separateprogram may then be used to index the beginning byte of the file andcopy the file segment to another file, which then becomes the PTO TextFile for the specific patent. It is possible for a PTO Text File to spanmultiple PTO Text Tapes. When this happens, a procedure is utilized bythe present invention to concatenate the multiple file segmentstogether. The VTOC created from the magnetic disk buffer is used toupdate a Relational Database System (RDB) for future reference, and thebuffer is then erased.

PTO Text Files are stored on the magnetic tapes in uncompressed format.PTO Image Files are stored on the magnetic tapes preferably incompressed format, preferably in Group 4 2D (two dimensional) faxformat. According to the present invention, PTO Text files are processedin uncompressed format. However, PTO Image files are processed at leastin part in compressed format. The processing of Image Files according tothe present invention is generally depicted in FIG. 75.

An example 2D compressed Image is shown in FIG. 75 as block 7506.According to the present invention, the 2D compressed Image 7506 isconverted to a 1D compressed Image 7508. Many functions performed by thepresent invention involve processing this 1D compressed Image 7508.(With some operations, such as with zooming and with pagination, the 1Dcompressed image 7508 is decompressed to an uncompressed format, asrepresented by item 7510 in FIG. 75. Typically, this uncompressed Imagefile contains 2320 bits by 3408 bits. Zooming and pagination isdiscussed below.) In contrast, conventionally such functions areperformed by solely processing uncompressed images.

The structure of a 1D compressed Image shall now be described withreference to FIGS. 76A and 76B. FIG. 76A illustrates a representation ofan example uncompressed Image 7602. A typical line 7604 in thisuncompressed Image 7602 is shown. This line 7604 includes a number ofblack spaces (each black space representing a logical 1 bit) and anumber of white spaces (each white space representing a logical 0 bit).

A 1D compressed Image 7606 corresponding to the uncompressed Image 7602is shown in FIG. 76B. This 1D compressed Image 7606 includes a line 7608(called the compressed line 7608) corresponding to the line 7604 (calledthe uncompressed line 7604) in the uncompressed Image 7602. Thecompressed line 7608 represents the uncompressed line 7604 byquantifying the number of black and white spaces in the uncompressedline 7604, while preserving the sequence of such black and white spaces.Thus, as indicated in the compressed line 7608, the uncompressed line7604 contains 128 black spaces, followed by 64 white spaces, followed by8 black spaces, followed by 64 white spaces, followed by 102 blackspaces, followed by 90 white spaces.

Procedures for converting between uncompressed Images, 2D compressedImages, and 1D compressed Images will be apparent to persons skilled inthe relevant art. Such procedures are discussed in many publiclyavailable documents, such as Federal Information Processing StandardsPublication No. 150, entitled "Facsimile Coding Schemes and CodingControl Functions for Group 4 Facsimile Apparatus," Nov. 4, 1988,incorporated herein by reference in its entirety.

Initial Automatic Pagination

The initial automatic pagination process is illustrated in flow chartform in FIG. 10. The automatic pagination process utilizes the PTO TextFile and creates an Equivalent File that is an initial approximation ofthe formatting of the original published patent.

The steps of initial pagination of the present invention are as follows:

1. Read the PTO Text File into memory of a computer system (for example,a computer system of the type shown in FIG. 2 may be utilized).

2. Assign each of the ASCII data records that begins with abibliographic information 1D code, an approximate location on thecorresponding image page of the PTO Image File at which its data shouldbe displayed. See the document "U.S. Patent and Trademark Office PatentFull-Text/APS File" for a listing of all the bibliographic data record1D codes. Also, see the document "Patents and Trademarks Style Manual"for a specification of how bibliography information is formatted onbibliography pages.

3. Process each of the paragraphs of the main body of the patent. Builda list of the locations of the Logical Groups that are found (see thedocument "U.S. Patent and Trademark Office Patent Full-Text/APS File"for a listing of the Logical Groups that can appear in the main body ofthe patent, i.e. "GOVT", "PARN", "BSUM", "DRWD", "DETD", "CLMS","DCLM").

4. Save the pagination information to disk in an Equivalent File.

In steps 2 and 3 above, the paragraph formatting procedure is performedwhenever there is a data value that might span more than one line in thecorresponding page image in the PTO Image File. In addition, theautopagination technique may be utilized on compressed data.

The manner in which the autopagination technique may be utilized oncompressed data shall now be described. As discussed above, the presentinvention generates a 1D compressed image file from the uncompressedimage file provided by the PTO. According to an embodiment of thepresent invention, pagination is performed using the uncompressed PTOtext file and the 1D compressed image file. This embodiment is describedbelow with reference to a flowchart 7802 shown in FIG. 78. Flowchart7802 begins with step 7804, where control immediately passes to step7806.

In step 7806, clumps are identified in the 1D compressed image file. Aclump is a group of dark spaces (each "dark space" representing alogical "1" value) that are adjacent to one another either vertically(between lines) and/or horizontally (within lines) and/or diagonally. Inan alternate embodiment, a clump can represent a group of white spaces.The operation performed in step 7806 is called "segmentation".Conventionally, segmentation is not performed using compressed dataimages. Instead, segmentation is conventionally performed usinguncompressed data images. According to such conventional procedures, itis necessary to search an uncompressed data image in the vertical,horizontal, and diagonal directions. However, since the presentinvention uses 1D compressed images, it is only necessary to search inthe vertical and diagonal directions (this is assuming that clumping isdone vertically, horizontally, and diagonally; if clumping is only donehorizontally and vertically, then the present invention searches onlyvertically). This is the case since 1D compressed images are alreadyclumped in the horizontal direction (this is apparent from FIG. 76B).Thus, the use of the present invention of 1D compressed imagessignificantly decreases the processing time to perform segmentation.

Preferably, the present invention in step 7806 searches for dark spacesin adjacent rows which vertically overlap. Consider an example 1Dcompressed image 8502 shown in FIG. 85A, where two rows 8504 and 8506are shown. Row 8504 has 2 dark spaces, followed by 3 white spaces,followed by 2 dark spaces, followed by 1 white space. Row 8506 has 3dark spaces, followed by 1 white space, followed by 2 dark spaces,followed by 2 white spaces. The present invention generates a table 8508shown in FIG. 85B from rows 8504 and 8506. Table 8508 containsinformation that denotes the boundaries between groups of white and darkspaces in rows 8504 and 8506. Table 8508 contains an entry for each rowin the compressed image 8502, such as entries 8510 and 8512 thatcorrespond to rows 8504 and 8506, respectively. Entry 8510 is derived byadding each value in row 8504 with the preceding value or sum. Thus, the"5" in entry 8510 is derived by adding "3" plus "2" from row 8504. The"7" in entry 8510 is derived by adding "2" from row 8504 plus "5" (i.e.,The prior sum). Each entry in table 8508 is generated in the same way.

Once table 8508 is generated, clumps are identified by analyzing thedark space boundary information contained in entries 8510 and 8512. Forexample, the dark space boundary information contained in entry 8510indicates that row 8504 has dark spaces in bit positions 1-2 and 5-7.The dark space boundary information contained in entry 8512 indicatesthat row 8506 has dark spaces in bit positions 1-3 and 4-6. Bitpositions 1-2 vertically overlap bit positions 1-3. Thus, these darkspaces in rows 8504 and 8506 represent at least a part of a clump. Also,bit positions 5-7 vertically overlap bit positions 4-6. Thus, these darkspaces in rows 8504 and 8506 represent at least a part of another clump.This analysis is performed for all of the entries in the table 8508.Note that it was possible to identify these clumps based on the darkspace boundary information contained in table 8508.

Each of the clumps identified in step 7806 may represent a character. Instep 7808, the clumps are compared to character templates. The charactertemplates are bit patterns corresponding to characters, such asalphanumeric characters, punctuation characters, graphical characters,etc. Thus, in step 7808, the clumps are compared to character templatesfor the purpose of recognizing the clumps as characters.

The operation performed in step 7808 is called "template matching."Preferably, template matching is performed by finding the center ofgravity of the clump being processed (each clump is processed, i.e.,matched, in turn) and the center of gravity of each template (thecenters of gravities of the templates are preferably calculated inadvance). The center of gravity is defined as the (x,y) location wherethe x coordinate in this (x,y) location is equal to the average of allof the x coordinates in the dark spaces of the clump, (the terms"spaces" and "pixels" are used interchangeably herein) and the ycoordinate in this (x,y) location is equal to the average of all of they coordinates in the dark spaces of the clump. Then, the clump isaligned with a template (each template is processed in turn) such thatthe center of gravities of the clump and the template coincide. Thenumber of pixels in the template and the clump having the same value isthen determined. Consider, for example, the pixels at the center ofgravity of the clump and the template. If they are both equal to 1, orare both equal to 0, then the sum is incremented by 1. Otherwise, thesum is not incremented. This comparison operation is performed for eachpixel in the clump and the template. Then, the sum is divided by thetotal number of pixels in the smallest rectangle enclosing both theclump and the template. If this resulting quotient (also called score)is above a predetermined threshold, then the clump is said to match thetemplate and is recognized as that character represented by thetemplate. Preferably, this predetermined threshold is approximately 90%,although other values could alternatively be used, and could vary fromtemplate to template. The above analysis is performed for each templateuntil the clump is recognized. It should be noted that not all clumpsare recognized.

In one embodiment, the character templates have been previouslycompressed such that they are 1D compressed character templates. Such 1Dcompressed character templates are compared to the clumps in step 7808.Alternatively, the character templates are not compressed. Instead, theclumps are decompressed, and are then compared to the-uncompressedcharacter templates in step 7808.

In step 7809, page parsing is performed. With respect to patentdocuments, the present invention first locates column numbers (appearingat the top of columns in patents) in the processed image file. This isdone by looking for clumps which have been recognized in the previousstep as large-sized numbers in the processed image file. Then, thepresent invention locates the patent number which appears in large-sizednumbers at the top of each page in a patent. As will be appreciated, PTOImage files include a series of line numbers (i.e., 5, 10, 15, 20, etc.)between the left and right columns on each page of text. The presentinvention searches for these line number sequences to identify thesecolumns. The present invention uses this information to identify whichclumps are in which columns. These clumps are assigned sequentialposition numbers, preferably starting from 1. Similarly, the charactersin the PTO text file are assigned sequential position numbers,preferably starting from 1. As described below, these position numbersare used to compare the processed image file with the PTO text file formatching purposes.

In step 7810, lines of characters (such characters having beenrecognized in step 7808) are identified. Step 7810 may be performedusing any well known line recognition technique. One such linerecognition technique operates by processing each character in turn. Ifthe center of a character is between the top and bottom of the previouscharacter, then the two characters are considered to be in the sameline. For reference purposes, the lines of characters recognized by theabove-described operations are called the processed image file.

In step 7812, the present invention matches the PTO text file to theprocessed image file. The purpose of this matching operation is toidentify the ends of lines, columns, and pages in the processed imagefile, and to then reflect such ends of lines, columns, and pages in thePTO text file to thereby generate the Equivalent File. The EquivalentFile is synchronized to the Image File on a line/column/page basis.

For example, suppose the PTO text file includes the following sentence:"The present invention includes a computer platform." In step 7812, thepresent invention matches each word in this sentence to words in theprocessed image file. Suppose that the word "computer" in this sentenceis presently being analyzed. The word "computer" from the PTO text fileis matched with an identical word in the processed image file. If thisword is at the end of a line in the processed image file, then thepresent invention reflects this end-of-line information in theEquivalent File. Similarly, if this word is at the end of a column orthe end of a page in the processed image file, then the presentinvention reflects this end-of-column/end-of-page information in theEquivalent File.

In one embodiment, Step 7812 is performed as follows. First, uniquepairs of adjacent characters (not counting spaces) are identified in thePTO text file. These character pairs may include overlapping charactersin words. Second, a look up table having an entry for each characterpair is created. The positions in the PTO text file where the characterpairs are located are stored in the respective entries of the table.Third, unique pairs of adjacent characters (in the horizontal direction)in the processed image file are identified. These character pairs fromthe processed image file, which are also called anchor pairs, mayinclude overlapping characters in words.

Processing then continues to map the anchor pairs to the characters inthe PTO text file. An anchor pair table is created having an entry foreach anchor pair. These entries include the position information fromthe lookup table associated with the PTO text file for the anchor pairs.Then, positions from this anchor pair table corresponding to impossiblesequences of characters are eliminated.

For example, a portion of an example PTO text file 8608 is shown in FIG.86. A processed image file 8606 corresponding to this PTO text file 8608is also shown. Only the clumps identified in step 7808 are shown in FIG.86. The lookup table for the PTO text file 8608 is shown as item 8610.Item 8612 represents the anchor pair table before positional informationis deleted. Such positional information is deleted as follows. The firstanchor pair, in this case "Th", is selected. The left-most position (inthis case, the only position) of this anchor pair is position 1. Theother anchor pairs are then evaluated with respect to this anchor pair"Th" to determine whether their positions (in the PTO text file) canpossibly correspond to the anchor pairs. First, the anchor pair "he" isevaluated. This anchor pair occurs at positions 2, 5, and 14 in the PTOtext file 8608. The anchor pair at "he" can occur only at position 2with respect to the anchor pair "Th", since it is known that "he" is inthe same word as "Th" (this information is known since "Th" is veryclose to "he" in the processed image file). Accordingly, positions 5,14, and 25 are deleted. Searching is performed in both a forward and abackward direction. Consider the case where the anchor pair "ab" isselected. The anchor pair "th" can occur only at positions 4 and 13 withrespect to anchor pair "ab", since anchor pair "ab" appears after anchorpair "th" in the processed image file 8606 (at least with respect to theclumps identified in the processed image file 8606). Each anchor pair isselected, and then the other anchor pairs processed with respect to theselected anchor pair, in both a backward and forward direction. After asmany of the positions from the anchor pair table 8612 have been deleted,it is possible to match the processed image file 8606 to the PTO textfile 8608 to identify end-of-lines in the PTO text file 8608.

Flowchart 7802 is complete after step 7812 is performed, as indicated bystep 7814.

The automated pagination feature of the present invention as describedabove results in an Equivalent File that is synchronized on a line,column, and page basis. In alternate embodiments, the text file isautomatically paginated such that the Equivalent File is onlysynchronized on a page basis, a column basis, a line basis, or anycombination of the above.

Pagination Correction Tool

The pagination correction tool allows a human to check and correct theresults of the initial automatic pagination process. A computer systemof the type illustrated in FIG. 3 may also be used. This tool is asoftware program with a graphical user interface that provides thefollowing capabilities, and completes the following steps:

Open and read into memory a PTO Text File;

Open and read into memory a previously edited Equivalent File on media;

Use a cursor control device (for example, mouse 42) to mark or urunarkcharacters that begin a patent column.

Use a cursor control device (for example, mouse 42) to mark or urunarkcharacters that begin lines within a patent column.

Add or remove blank lines to set the appropriate vertical line spacingso that the lines of text fall on the same line numbers of the patenttext column they are in as shown in the PTO Image File;

Use a cursor control device to mark or unmark paragraphs as beingsection titles.

Indicate which figures are on which drawing sheets.

As is typical in computer programs, the specified tasks listed above donot need to be performed in any particular order except that the filemust be opened at the beginning and closed (and usually saved) at theend.

In an alternate embodiment, automatic pagination is not performed.Instead, pagination is entirely, manually performed by using thepagination correction tool. This embodiment is particularly useful whenit is only necessary to synchronize on a page basis, or a column basis,for example. For reference purposes, such page basis synchronization,column basis synchronization, etc., are collectively calledsynchronization levels.

In other embodiments, the operator is provided with an option ofautomatic pagination or manual pagination, or any combination of thetwo. This embodiment is represented by a flowchart 7702 shown in FIG.77. In step 7710, the operator may select automatic pagination or manualpagination. If the operator selects automatic pagination, then step 7712is performed, wherein pagination is performed automatically, asdiscussed above. After step 7712 is performed, or if the operator didnot select automatic pagination in step 7710, then step 7714 isperformed. In step 7714, the operator uses the pagination correctiontool to perform manual pagination.

Indexing

A B+-tree inverted index of words is generated for a group of one ormore Equivalent Files to greatly speed the process of searching the textof the files. These indexes are built from all the words in the PTO TextFile. The index generator ignores end-of-line hyphens when buildingindexes but does not ignore hyphens in the middle of lines.

The present invention utilizes the following build/search indextechnique: when indices are built, all punctuation marks in a text fileare stripped, and the resulting alphanumeric words are enteredindividually into an index database. The word position in the text fileis also stored. For example, a string such as " Ax,Bx,Cx!" is convertedto three separate words--"Ax", "Bx" and "Cx", and the individual wordsare entered into the index database as three separate items.

When a user enters a search string such as " Ax,Bx,Cx!", the string isconverted into the following tokens: "Ax", "Bx" and "Cx". The tokens aresearched using the text conversion technique described above, and theresulting search produces three lists of search matches. These lists areprocessed and filtered for all occurrences. The occurrence of "Ax" isimmediately followed by the occurrence of "Cx". This technique allowswords originating from the source text to be searched directly, withoutthe need to store a large number of punctuation mark locations.

User Interface

The graphic user interface of the present invention is comprised in partof a computer program which is stored in either mass memory 60, CD-ROM62, or floppy disk 64, of the system illustrated in FIG. 3. Appropriateprogramming code is loaded into memory 55 by the I/O circuit 50 andexecuted by the CPU 52. It will be appreciated that the computer programof the present invention may also be stored in random access memory(RAM), or in other machine readable form and media. The graphic userinterface displays the Equivalent File and the PTO Image File, describedabove in previous sections, and provides a variety of viewing andediting options.

Referring now to FIG. 11, the display screen 68 is shown in detail.Illustrated within the display 68, is a title bar 100 for identifyingthe title of the program in which the user interface of the presentinvention is utilized. In the example of FIG. 11, the title of theprogram is PatentWorks Workbench™, however, depending on the nature ofthe program in which the present invention is used, the title may changein accordance with the particular application. In addition, a menu bar102 is provided which includes a plurality of command options such as"Case", "Edit", "Patent", "Note", "Library", "View", "Window", and"Help". Additionally, other context specific command options may bedisplayed depending on the specific application in which the presentinvention is used.

As illustrated in FIG. 11, a tool bar 103 is displayed immediately belowthe menu bar 102. The tool bar 103 comprises the primary source ofoptions and selection items which a user of the present invention willcommonly access. As will be described more fully below, the tool bar ofthe present invention includes a briefcase icon 106, a direction button107 for dropping a list of available cases, a light bulb icon 108 fordesignating a patent to a case, and a direction button 109 for obtaininga list of all patents or other documents which may be displayed in anEquivalent File format from a case. Additionally, a library icon 110 isprovided on the tool bar 103, the selection of which provides a listingof all the patents available in the patent library. A magnifying glassicon 112 is displayed for selecting a search box to appear on thedisplay screen. A target icon 113 is provided for identifying searchresults. Other icons displayed along the menu bar 103 include a printericon 115 for printing documents. A case note icon 125 for displayingcase notes is also provided on the tool bar 103. A patent note icon 126and a direction button 127 are also provided for reviewing and accessingpatent notes.

The specific functions and operations of these various icons and commandoptions displayed on the menu bar 102 and tool bar 103 will be describedmore fully below. It will be noted that all tool bar icons or buttonfunctions have keyboard equivalents designed to allow the user toperform the functions of the icons and button functions without usingthe cursor control device. All of the functions of the tool bar 103 arealso displayed in the menu bar drop down menus. Additionally, as shownin FIG. 11, two instructional arrows 129 and 130 are displayed on screen68. The instructional arrows 129 and 130 provide initial instruction tothe user to begin work using the interface of the present invention.These instructional arrows may be selectively turned on or off by theuser. Moreover, as shown in FIG. 11, in the lower left hand comer of thescreen is a minimized image of the current library identified as the"Detkin" library.

Referring again to FIG. 11 and FIG. 3, in accordance with the teachingsof the present invention, a user may access various functions by placingthe cursor 44 over a command option on the menu bar 102 and signalingthe CPU 52 using the mouse 42 or keyboard 56. A variety of methodologiesmay be employed for the selection of subcommand items illustrated in thevarious drop down menus once a command option on a menu is selected. Thepresent invention operates independently of the particular methodologyfor function selection employed by the computer system illustrated inFIG. 3.

As illustrated in FIG. 12 and FIG. 3, the placement of cursor 44 overthe direction button 107, and the activation of button function 107using either the mouse 42 or the keyboard 56, results in the display ofa list 132. The list 132 lists all the cases in the system. In theexample illustrated in FIG. 12, there is currently one case which existsin the "System" library, and which is referred to as "demonstration". Itwill be appreciated that if the "System" library includes additionalcases, then the names of these cases will be displayed in the list 132as well. As illustrated in FIG. 13, the selection of the case referredto as "demonstration" results in the display of the case name along thetool bar 103. Additionally, the instructional arrows of the presentinvention provide guidance to the user as to available options which maybe selected in the current state of the user interface. For example, inFIG. 13, instructional arrow 139 instructs the user to click on downarrow 109 to open and view the list of patents already disposed withinthe case "demonstration". Instructional arrow 140 informs the user thathe may click on the library icon 110 to open the library and add patentsto the case "demonstration". A case may contain patents from severallibraries.

Referring now to FIG. 14, assume for example that a user activatesbutton function 109 which results in the display of patents within thecase "demonstration". Assume further that the user has selected U.S.Pat. No. 4,760,478 (hereinafter . . . 478"). As shown in FIG. 14, thecomputer system illustrated in FIG. 3 displays the Equivalent File ofthe '478 Patent in an equivalent window 160. As previously described inthe specification, the Equivalent File of the '478 Patent was generatedin accordance with the teachings of the present invention, including theprocesses of extraction, synchronization, indexing and the like ashereinabove described. Additional features of the equivalent window 160,and its operation in conjunction with other functions of the presentinvention will be described in more detail below. Also shown in FIG. 14,is a Patent Text Toolbox 162. A downward arrow button function 165 hasbeen activated in FIG. 14 by a user, resulting in the display of a dropdown list 170. As shown, the drop down list 170 includes a listing ofthe sections of the Patent '478 displayed in the equivalent window 160.

As will be described, a user may quickly navigate from section tosection by selecting one of the various sections of Patent '478displayed in list 170. For example, in FIG. 14, the bibliography sectionhas been selected. In response to the selection of the bibliographysection by the user, the CPU 52, illustrated in FIG. 2, displays thebibliography portion of the Equivalent File in the equivalent window160. A user may verify that the bibliography section is currently beingdisplayed within the equivalent window 160, by observing a letter "B"(referred to by numeral 175) displayed along the right edge of theequivalent window 160. In the present example, the letter "B" indicatesthat the text displayed within the equivalent window 160 corresponds tothe bibliography section of the patent. Additional features andfunctions of the equivalent window 160 will be described more fullybelow.

As illustrated in FIG. 15, the activation of the library menu on menubar 102 results in the display of a library drop down menu 150. The menu150 includes a variety of command items including "Open Library","Search", "Case Cross Reference", and "Set Library Directories". Forpurposes of this specification, a description of the various functionsof the present invention will be set forth below. However, it will beappreciated by one skilled in the art, that the operation of the presentinvention is dynamic, and that a particular order or sequence of eventsillustrated herein is only one of a variety of possible sequences ofimages and operations which the present invention is capable ofperforming. Since the present invention comprises a graphic userinterface which permits an operator to interact with the computer systemillustrated in FIG. 3, the particular sequences of operations anddisplays generated herein, are dependent upon the computer systemillustrated in FIG. 3 in cooperation with the human operator.

Referring now to FIG. 16, assume for example that a user selects thefunction "Set Library Directories" from the library drop down menu 150illustrated in FIG. 15. In response to the selection of the SetLibraries Directories command, the CPU 52 in FIG. 2 displays a SetLibrary Directories dialog box 175 as shown in FIG. 16. The Set LibraryDirectories dialog box 175 permits a user to define the directories thatcontain libraries, and the directory used when creating new libraries.The Set Library Directories dialog box 175 includes a variety of dialogbox options. As shown in FIG. 16, a directories window 180 displayscurrent directories available to the user. A user may place cursor 44 inFIG. 3 over a desired directory, and signal the computer to select thedirectory using the keyboard 56 or the mouse 42 in FIG. 3. After theselection of a directory, the directory may be added to the path listusing the add directory button function 185. By double clicking on adirectory to open the directory, the directories contained in theselected directory are then listed below the selected directory within awindow 180. Additionally, various drives may be selected such as the CDdrive 62 (see FIG. 3). An icon representation of the CD drive 62 is alsodisplayed in the Set Library Directories dialog box 175. For example, inFIG. 16 the CD drive 62 is represented by an icon 200. By clicking on anicon to select it, all directories contained on the selected drive arethen listed in the directories list displayed in window 180. Asillustrated, the currently selected directory is also identified withinthe Set Library Directories dialog box 175. The currently selecteddirectory may be a directory selected from the directories list or pathlist. Once a user has selected a directory, the activation of the adddirectory button function 185 adds the directory to the list ofdirectories containing libraries in the window 190. A button functionRemove Directory 205 removes a selected directory from the path list.Once removed, the directory is no longer used in searches for availablelibraries. The Set Library Directories dialog box 175 also includes aSet as Default button function 210. The Set as Default button 210 setsthe currently selected directory as the default directory. When settingdirectories for libraries, the directory set as a default is thedirectory where new libraries are created. As illustrated in FIG. 16,the Set Library Directories dialog box 175 also identifies the currentdefault directory.

Referring now to FIG. 17, selecting a New Library option from menu 150in FIG. 15, results in the display of a New Library dialog box 225.Using the keyboard 56 in FIG. 3 or other input device, a user may theninput a library name into an open field 230 for the new library, andcreate a new library using the inputted name.

The selection of the Open Library sub-command item from menu 150 in FIG.15 results in the computer 48 in FIG. 3 generating and displaying anOpen Library dialog box 235, illustrated in FIG. 18. As shown in FIG.18, the Open Library dialog box 235 includes a field 237 which displayspreviously created libraries in reference to only those libraries foundin directories specified in the window 190. Additionally, the librariesare identified by name, for example, an "aaa" library 240 and a "bbb"library 242, are illustrated within the field 237. A scroll bar (notshown) is provided to permit the user to scroll through the variouslibraries in the event there are more libraries than can be displayedwithin the field 237 at any one time. The particular mechanism forscrolling will not be disclosed herein, since the scrolling of textualwindows is known in the art.

In the presently preferred embodiment, referring to FIG. 18 and FIG. 3,a library is selected by the user by placing the cursor 44 over alibrary icon 238, or the name of the library, and double clicking switch46 on the cursor control device 42. Alternatively, by placing cursor 44over the name of the library or icon 238 and clicking switch 46 a singletime, the library is highlighted. The library may then be selected byplacing the cursor 44 over the OK button 250 and clicking switch 46. Asshown in FIG. 18, once a library is selected (in the example of FIG. 18,the library "bbb"), the computer 48 highlights the name of the library.As used in this specification, a "library" contains a collection ofelectronic patents, which includes the Equivalent File and PTO ImageFile of each listed patent.

Referring now to FIG. 19, the selection of a library (in the presentexample, the "bbb" library) results in the computer 48 in FIG. 3generating and displaying a Library content dialog box 260. As shown,box 260 includes a field 262 in which all patents comprising theselected library are listed. In the example illustrated in FIG. 19, the"bbb" Library includes only four patents (U.S. Pat. Nos. 4,760,478;4,783,757; 5,073,969 and 5,165,027). Additionally, all patents disposedwithin a library are identified by a light bulb icon 265 as well as thepatent number as shown in the figure. Additional patent specificinformation may also be included such as inventor's name, assigneeinformation, patent titles, etc. Box 260 further includes a patentlibrary icon 270 to denote box 260 as comprising library patents, asopposed to case patents which will be described below. Box 260 furtherincludes a variety of other button functions, such as a "remove" button274 to remove a patent from the library, a "create a new case" button276 for creating a new case from within the library box 260, and an "addto" button 278 for adding patents to a case.

In addition, as illustrated in FIG. 19, box 260 includes a "select all"button function 284 to choose all patents in the library window 262 toadd. A user may select a single patent to view, in which case, thecomputer displays the patent Equivalent File in the equivalent window160 illustrated in FIG. 14. The selection of an OK button 286 dismissesthe window 260 and executes the function which the user has selected.

As shown in FIG. 20, the selection of Patent '478 by the user results inthat patent being highlighted on the display. Box 260 further includes adownward arrow button 290 to minimize box 260 into the library icon270,. as shown in FIG. 21. To minimize box 260, cursor 44 of FIG. 3 isplaced over button 290, and the mouse 42 of FIG. 3 is momentarilyclicked. Computer 48 of FIG. 3, sensing the momentary depressing ofswitch 46, minimizes the box 260 as shown in FIG. 21 as the library icon270 identified by the name of the library which has been minimized.

The selection of the Update Library sub-command item within menu 150 ofFIG. 15 results in the display of an Update Library dialog box 300illustrated in FIG. 22. It will be noted that the Update Library box 300refers to the current library in use, namely the Intel® Library in thepresent example. As shown in FIG. 22, the Intel® dialog box includes thename of the current library, the date of the update, as well as an OKbutton and a "cancel" button to cancel the box display. In the presentlypreferred embodiment, the Update Library sub-command may only beselected once a library has previously been selected.

Referring now to FIG. 23, the selection and operation of the Searchsub-command from the menu 150 of FIG. 15, or from the search icon 112,will be described. Upon the selection of the Search sub-command item,the computer 48 in FIG. 3 generates and displays a Search Library dialogbox 302 with the option to search in the current patent, currentlibrary, or current case. In operation, once the location of the searchhas been selected (for example, to search the current library), the userhas the option to input various search words in a search word field 304within the box 302, which may include Boolean terms such as "AND", "OR",and other logical search terms, such as for example, proximity searcheswithin five, ten or twenty five words of a selected word. Additionally,the user may recall previously saved searches (button function 309),view word lists from selected patents being searched (button function306), and save the current search (button function 308), or exit thedialog box by depressing a cancel button function. A variety of logicalBoolean terms are predefined as button functions such as AND, OR, aswell as other common search expressions to assist the user in definingthe search. As shown in FIG. 23, box 302 includes a word list buttonfunction 306. The activation of the cancel button 307 results in thedismissal of box 302.

Continuing to refer to FIG. 23, additional features of the Search dialogbox 302 will be described. As previously discussed, the Search dialogbox enables a user to conduct a search for a phrase or group of phrasesincorporating Boolean terms. The user may choose the current library,the current patent, or the current case from the menu option, as shownin FIG. 23. To conduct a search of patent notes, the reader is referredto the "find" command from the notes menu bar, to be described morefully below. Assume for example that a user desires to conduct a search.First, the user must select a desired scope of the search. The scopeidentifies how much information will be searched. The selection of the"current patent" in box 302 results in the currently active patent beingsearched. Upon completion of the search, all occurrences ("hits") of thesearch string will be highlighted in the text of the Equivalent Filedisplayed in the equivalent window 160 in FIG. 28. The user may alsosearch the current library by clicking on that option in box 302. If thecurrent library is selected, the search will encompass the entire patentlibrary currently open. Upon completion of the search, all patents inthe library found to contain the search string will be listed in theSearch Results dialog box which will be described with reference to FIG.26. If the user selected the "Current Case" by clicking on that optionin box 302, the currently open case will be searched. Upon completion ofthe search, all patents in the case found to contain a search stringwill be listed in the Search Results dialog box. It is also possible forthe user, using the teachings of the present invention, to search forwords containing sub-words or parts thereof. By entering words ornumbers to find in field 304 of the box 302, and activating a Searchbutton function 305, the search is initiated for the inputted wordsand/or numbers. A complex string search may be conducted by combiningwords with Boolean terms such as AND, OR, or proximity searches. As willbe described, a user may also select a saved search or select wordsusing a word list. A user may also select portions of text from theEquivalent Text File. A Clear button function 309 in the search librarydialog box 302 enables the user to clear current search words which havebeen defined in the search window 304. The activation of the Clearbutton 309 results in the clearing of the entire contents of the window304. Alternative methods for clearing are the use of a back space deletekey from the keyboard 56 in FIG. 3. In addition, words can be copied andpasted onto the field 304 of the box 302.

With regard to the Boolean expressions, as is well known, the "AND" termresults in all occurrences of a word AND a word to be searched. Forexample, searching the words "data AND device" will yield alloccurrences of the term "data (and all occurrences of the word) device"throughout the entire search scope. For example, searching the string"data OR device" will yield all occurrences of either "data" or"device". An asterisk option signifies a wild card in the search term.This allows searching using incomplete words or a word which may containadditional characters to the term being search. For example, searching"de*" will find all the words in the search scope that begin with "de",such as "device", "denote", etc. The use of proximity searches withinfive, ten, or twenty five words is also well known, and supported by theinterface of the present invention.

The Save Search button function 308 allows the user to save the searchin a separate table which may be retrieved later for viewing, editing,or adding into other searches by clicking on the "Get Saved Search"button function 309. The Get Saved Search button function 309 lists allpreviously saved searches. A user may select any of the previouslysearched items to view. The saved searches are listed in alphabeticaland chronological order. The activation of the button function Word List306 results in a list of all words which exist in the patent or library.

Referring to FIG. 24 and FIG. 3, the placement of cursor 44 over theWord List button function 306, and the momentary depression of switch 46on the mouse 42, results in the generation and display of a word listdialog box 310, illustrated in FIG. 24. The word list dialog box 310includes an alphabetized listing of all words which exist in theselected patent, or all patents in a library, with a numericalidentifier indicating the number of times the particular word occursappearing in the left hand column of the word list. For example, withinthe Detkin Library and the selected patent, the word "abandon" occursseven times. Similarly, the word "ability" occurs two times. By placingcursor 44 over a letter tab of the list, the user is able to quicklymaneuver through the Est of words. For example, as shown in FIG. 25, byplacing cursor 44 over the letter "D" and momentarily clicking switch46, computer 48 generates and displays a listing of all words beginningwith the letter "1D" within the selected patents within the field ofsearch identified in box 302. In addition, it will be noted that throughthe use of a scroll bar 311, the word list is scrollable. The selectionof the word "data" from the word list results in the word automaticallyappearing in the search window 304. Placing the cursor 44 over the "wildcard"(*) button function 314, permits the user to identify all wordswhich begin with the selected word and any additional charactersfollowing the word. For example, if the user has selected the letters"de" and attached a wild cards at the end of the word (for example,de*), then the search dialog box 302 will locate all words which beginwith the letters "de" and have any ending attached thereon. In addition,the "wild card"(*) may be placed in front of a suffix. For example, ffthe user enters the word "*tion", the search dialog box 302 will findall words which end with the letters "tion".

Assume for example that the user desires to initiate the searchidentified in FIG. 25 having the search word "data" in window 304 forthe current library. By placing the cursor 44 over the search button ofthe box 302 and clicking switch 46, computer 48 in FIG. 3 generates anddisplays a Search Results box 320, as shown in FIG. 26. The SearchResults box 320 lists the number of times the selected word occurs inall of the patents of the particular library. In the present example,the Detkin Library comprises twenty three patents that contain thesearch word, "data". The display area 329 shows the '233, '125, '742,'660, '262, '056 and '055 patents. The other sixteen patents will appearin the display over 329 as the user scrolls the area 329 down. As shownin FIG. 26, the search term "data" occurs 101 times in patent '233, and247 times in patent '262. Additionally, it will be noted that the SearchResults box 320 identifies the number of patents in which the searchterm was located, and the number of patents which are selected. The usermay now select a patent to view each of the occurrences of the word"data". Moreover, the Search Results box 320 includes a button functionto create a new case, or the user may add the patents to a current caseusing a separate button function.

Assume for example that the user desires to view the Equivalent File ofpatent '233. The user places the cursor 44 of FIG. 3 over a portion ofthe patent search result listed in box 320. For example, the user mayplace cursor 44 over the light bulb icon, the number of occurrences (inthe current example, 101) or any portion of the patent number, andmomentarily depress switch 46. The selection of a patent within box 320results in the computer 48 of FIG. 3 highlighting the selection.Double-clicking on the selection, or the activation of the View buttonfunction 325 results in computer 48 displaying the Equivalent File ofthe library version of the patent, and displaying all instances of thesearch term (in the present example, the word "data"). As illustrated inFIG. 28, in the presently preferred embodiment of the invention, theEquivalent File of the selected patent is displayed on display 68 ofFIG. 3 in the left hand portion of the screen 68 within an Equivalentwindow 160. The repetitive activation of the right arrow button 351results in the navigation to each "hit" of the search term in theequivalent patent text. Also illustrated in the figures as part of theequivalent window 160 are (+) button functions 372 and (-) buttonfunction 374 (see for example, FIG. 27). The activation of the plusfunction 372 or minus function 374 results in computer 48 of FIG. 3displaying the Equivalent File, in increments based on the columnnumbers. For example, the activation of the plus button 372 will resultin the next column of the patent being displayed in the Equivalentwindow 160. The activation of the minus button 374 results in adecrement so that the previous column of the patent is displayed.Additionally, the present invention supports the scrolling of theEquivalent File in window 160, such that the text may be scrolledthrough any displayed column.

As illustrated in FIG. 26, the Search Results dialog box 320 includes aSelect All button function 328. The activation of the Select All buttonfunction 328 results in the selection of all patents in the results listdisplayed in area 329. The selected patents can be added to the currentcase or create a new case with the search results displayed in area 329.

As illustrated in FIG. 27, the selection of the button functionLibrary/Case Cross-reference from menu 150 (see FIG. 15) results in thedisplay of a Library to Case Cross Reference dialog box 350. TheLibrary/Case Cross-reference dialog box 350 permits the user to view alist of all patents on the current library cross referenced to the casesin which they are utilized. As illustrated, the dialog box 350 "floats"over the other displayed windows, such as for example, the equivalentwindow 160, and may be selectively placed at any location on screen 68of FIG. 3, as may many of the present invention's dialog and controlboxes.

Assume for example that the user selects the view button function 325 ofthe Search Results dialog box 320 in FIG. 26. The computer 48 of FIG. 3displays the Equivalent File of the selected patent (in the example ofFIG. 28, the '478 patent) and highlights all instances of the searchterm occurring in the Equivalent File displayed in equivalent window160. Additionally, computer 48 generates and displays the Patent TextToolbox 162, which takes the form of a window which may selectively bemoved and retained anywhere on screen 68 in FIG. 3. As shown, PatentText Toolbox 162 includes a number of button functions including a rightarrow button function 351, a left arrow button function 352, a patentidentifier box 355 which identifies the currently displayed patent, anda down arrow button 165 which, upon selection, displays a menu includingsection headings of the patent currently being displayed. Asillustrated, the Patent Text Toolbox 162 further includes a "Hit" window360 which identifies the occurrence number of the search term. Forexample, in column 3 of the '478 patent illustrated in FIG. 28, thefirst occurrence of the search term would be described in window 360 as"Hit 1 of 25", where the total number of occurrences of the term number25. By activating the right arrow button 351, the next Equivalent Fileis scrolled to the occurrence of the search term in equivalent window160.

In FIG. 28, the sub-heading "Brief Description" is currently selected.By activating button 165, all other headings of the patent (for example,Background of the Invention, Prior Art, etc.) are displayed for theparticular patent as previously described with reference to FIG. 14. Itshould be noted by the reader that the listing of headings displayed bythe activation of button 165 is a listing of the actual headings in thepatent selected. Since patents, like many other documents, are tailoredby the author of the document and identified by unique headings,computer 48 of FIG. 3 generates and displays the actual headings used inthe particular patent which was selected, as opposed to pre-definedheadings which may or may not apply to the specific patent which isbeing viewed.

With reference now to FIG. 29, Patent Text Toolbox 162 includes aplurality of marker icons 381, 382, and 384, which represent markers ofvarying colors for highlighting portions of the text in the displayedEquivalent File, as will be described more fully below. A pen icon isalso provided for identifying portions of patent text. As previouslydescribed, the activation of right arrow button 351 results in computer48 of FIG. 3 scrolling through each sequential instance ("hit") in whichthe search word is encountered after the first instance. By activatingleft arrow button 352, the reverse order will be generated by thecomputer 48 of each instance of the selected search term. As theinstances of the search term are incremented or decremented,respectively, the window 160 is updated (e.g. "hit 5 of 24", "hit 2 of24", etc.). Moreover, it will be noted that in FIG. 28, the Patent TextToolbox 162 also includes a library icon or a case icon. The display ofa library icon indicates that the Equivalent File of the patentdisplayed in the Equivalent window 160 is a library copy. As a librarycopy, the user may not make notes or highlight the Equivalent File. Aworking Equivalent File is denoted by a case icon (similar to icon 106)in place of the library icon (see FIG. 29). As described herein, theworking Equivalent File may be searched, annotated and/or highlighted bythe user.

Further, it will be noted that the current heading identifier (forexample, "Brief Description") illustrated in the box 162 corresponds tothe text displayed in the upper portion of window 160. Thus, in theexample illustrated in FIG. 29, the bibliography begins at column B1.Since the upper portion of the display window 160 displays thebibliography of the patent, the heading identifier within box 160identifies the currently viewed portion of the patent as the"Bibliography".

Referring now to FIG. 30, the Equivalent File of patent '478 includeslocation identifiers for various sections of the Equivalent Filedisplayed. In the example illustrated in FIG. 30, the Bibliographysection of the patent is identified by the letter "B". The letter B(400) appears vertically along the length of the column B1 in the patentequivalent '478. In the presently preferred embodiment, the B identifier400 is in the color blue. Referring to FIG. 30 and FIG. 3, placing thecursor 44 over any one of the letter Bs and clicking switch 46 on mouse42, the corresponding bitmapped image file of the Bibliographic officialUnited States patent document is displayed in an image window 410 ondisplay 68. The linking between the Equivalent File displayed inEquivalent window 160 and the PTO Image File displayed in image window410, was previously described with reference to synchronization in thisspecification.

The present invention provides the user with the ability to viewmultiple versions of the same patent on display 68 in FIG. 3. In FIG.31, the patent Equivalent File displayed within the window 160represents the searchable ASCII text equivalent of the official UnitedStates patent, as previously described in this specification. The patentimage displayed in image window 410 of the screen 68 represents the PTOImage File of the official United States Patent, including the figuresfor the patent. As previously noted, the image patent in window 410 isnot searchable, and is not in an ASCII text format, but may bemanipulated and enlarged within the window 410 as may any bitmappedimage. As shown in FIG. 30, window 410 includes a down arrow buttonfunction 415 that is used to minimize window 410 and a minus buttonfunction 420 (see FIG. 31) that is used to scroll down the window 410 bypage. Also illustrated in FIG. 30 is a Patent Image Toolbox 430, whichmay be selectively positioned anywhere on display 68 by the user. PatentImage toolbox 430 includes a variety of functions which may be employedwhen operating on the patent image in window 410. One of these functionsincludes a rotate image icon 435 which, upon activation, rotates theimage horizontally, then back vertically, if the icon 435 is activatedagain. The Patent Image Toolbox 430 further includes a Go To elementicon 437, which, upon activation, permits a user to type in usingkeyboard 56 of FIG. 3, an element number (from, for example, a drawingdisplayed in image window 410). The computer of the present inventionthen searches the Equivalent File of the patent displayed in Equivalentwindow 160 and displays that portion of the Equivalent File in which thefirst instance of the element number is described. The Patent ImageToolbox 430 also includes various magnification icons, for example, a"1×" icon 440, a "2×" icon 442, and a "3×" icon 445. The activation oficon 442 results in a zoom option, such that medium resolution isachieved and the image displayed within the image window 410 isenlarged. The activation of icon 445 results in a three times zoomoption to permit a user to view images displayed within image window 410with the highest degree of resolution in the presently preferredembodiment.

As illustrated in FIG. 31, the toolboxes of the present invention may beselectively placed at any lobation on the screen 68. To move a toolboxto a different location, cursor 44 of FIG. 3 is placed over a portion ofa top bar area (for example, top bar 450 of box 430 in FIG. 30, or topbar 455 of box 162 in FIG. 31). Referring to FIG. 31 and FIG. 3, oncethe cursor 44 is placed over the top bar area of a toolbox, switch 46 isdepressed, thereby resulting in the effective coupling of the cursor 44to the toolbox. The movement of the mouse 42 results in a correspondingmovement of both the cursor 44 and the toolbox, such that the toolboxmay be placed at any location on screen 68. Cursor 44 may be decoupledfrom the toolbox by simply releasing the switch 46.

Referring once again briefly to FIG. 30, it will be noted that the imagewindow 410 includes a vertical scroll bar 460, and a horizontal scrollbar 470. Using the vertical scroll bar 460, the image displayed withinthe image window 410 may be scrolled vertically. Similarly, the use ofthe horizontal scroll bar 470 permits the horizontal scrolling of theimage illustrated in the image window 410. Moreover, a minus function("-") 472 and a plus ("+") function 474 are provided along the verticalscroll bar 460. Functions 472 and 474 permit a user to navigate throughimages of both the drawings and text (the complete PTO Image File) ofthe patent displayed within the image window 410. Accordingly, it willbe appreciated by one skilled in the art, that a user may view the imageof a patent in window 410, and the text equivalent in equivalent window160, in an unsynchronized manner, or in a synchronized fashion, asdescribed herein. The keyboard 56 in FIG. 3 includes a "up PG" key and a"down PG" key to allow for the unsynchronized viewing of the imageswithin window 410 as well.

Referring now to FIG. 32 and FIG. 3, there is shown the example whereincursor 44 is placed over down arrow button 165 of the control box 162and switch 46 is activated. Computer 48 generates the list 170 whichdisplays the various groupings found within the displayed patent (in thepresent example, patent '027). As shown, patent '027 includes a"Bibliography", an "Abstract", "Background of the Invention", BriefDescription of the Drawings", Detailed Description of the Invention",and "Claims". By placing the cursor 44 over any of the sections listedin the list 170, and clicking switch 46, computer 48 displays theselected text portion of the Equivalent File (for example,"Bibliography" in FIG. 32) within window 160.

Referring to FIG. 33 and FIG. 3, the present invention provides amechanism by which a user may view corresponding patent drawings in thePTO Image File displayed in window 410, by selecting figure numbers inthe Equivalent File within window 160. By placing cursor 44 over afigure number in the Equivalent File displayed in window 160, andmomentarily double-clicking switch 46 on mouse 42, CPU 48 locates anddisplays the image of the corresponding figure within the imagedisplayed in window 410 (see FIG. 33). It will be appreciated that thepresent invention's capability of locating and displaying the selectedfigure within the image window 410 is due to the synchronization of theEquivalent File in window 160 with the PTO Image File displayed withinimage window 410, as previously described in sections above. As will beappreciated, the synchronization of the Equivalent File with the PTOImage File, permits a user to place the cursor 44 over a reference to afigure within the Equivalent File, activate the cursor control device,and thereby view the corresponding figure within the image window 410.

To enlarge a portion of the figure displayed in the image window 410,the user places cursor 44 over the area of the image where theenlargement is desired and depresses switch 46. CPU 48 then generates adynamic outline box 500 (see FIG. 34). By continuously depressing switch46 and selectively moving mouse 42, the user defines an area within theoutline box 500 to be enlarged. The user then releases switch 46, and asshown in FIG. 35, that portion of the image in window 410 disposedwithin the outlined box 500 is enlarged and displayed in the window 410.In the example illustrated in FIGS. 34 and 35, the user has defined anoutline box 500 over a portion of FIG. 3 of the '478 Patent. Theenlarged portion is then displayed as shown in FIG. 35. Additionally, aspreviously described, other controls are provided in the Patent ImageToolbox 430 for rotation of the image, enlargement of the image in setincrements, and to provide additional functions in operating on theimage within the window 410.

Referring to FIG. 35 and FIG. 36, assume for example that a user desiresto locate the first instance in the Equivalent File of patent '478 whichrefers to element 28. In a typical application of the present invention,the user will locate a particular element number of interest illustratedin the figures of the patent (and displayed within image window 410).The activation of icon 437 of the Patent Image Toolbox 430 in FIG. 35results in display of a Select Element Number dialog box 502 in FIG. 37.Using the keyboard 56 of FIG. 3, the user may input the particularelement number of interest, and activate a Find In Text button function504 provided in the Select Element Number dialog box 502. The presentinvention searches and then finds the 5first and the subsequentinstances of the selected element number within the Equivalent File, anddisplays in highlighted form the text in window 160 beginning at thefirst instance ("hit") location in the patent. The subsequent instancesare also highlighted. To view each instance that the element is found,the user proceeds to click the mouse switch 46 while the cursor 44 isplaced on the right arrow button 351 in FIG. 30. When every instance ofthe element has been viewed, a further click on the right arrow button351 in FIG. 30 will indicate in the Patent Text Toolbox 162 that thereare no more hits.

Referring now to FIG. 37 and FIG. 3, the present invention's highlightfunctions will be described. To highlight text within a case copy of theEquivalent File displayed in window 160, a user places cursor 44 over amarker having a desired color. In the presently preferred embodiment,marker icons 381, 382, and 384, are displayed as having different colorsin the Patent Text Toolbox 162. By placing the cursor 44 over one of themarker icons, such as for example marker icon 381, and momentarilyclicking switch 46, the marker color is selected. The movement of cursor44 into the area defined by window 160, results in cursor 44 having thevisual appearance of a marker tip similar to that shown in icon 381. Byplacing the cursor 44 over a desired portion of text within theEquivalent File displayed in window 160, and depressing switch 46, themarker is "turned on". Dragging the cursor 44 over the text to behighlighted, results in that portion of the text having a color tintcorresponding to the color of the marker which has been selected. Themarker may be turned "off" by releasing switch 46 on mouse 42. In FIG.37, a portion of the text has been highlighted and identified as ahighlighted area 540. Due to the limitations of the writtenspecification and the inability to illustrate color, the highlightedarea 540 is simply shown as a black rectangle. Additionally, acorresponding color indicator 542 is displayed next to the highlightedtext 540. As will be described more fully below, placing cursor 44 overcolor indicator 542 and clicking switch 46 results in the patent notewindow, originally connected to this location in the Equivalent File,being displayed on screen 68 in which the user may read or type notesrelated to that portion of the highlighted text. Color indicators havedifferent shapes to aid in identification by users with monochromaticdisplay screens, or those with color blindness.

In the presently preferred embodiment, color indicator 542 furtheridentifies the color by using symbols. For example, in the presentembodiment, if a red marker is selected, color indicator 542 is roundand red. However, if a yellow marker is selected, then the colorindicator 542 is in the shape of a yellow triangle, or if a green markeris selected, color indicator 542 is in the shape of a green rectangle.Additionally, it is possible using the teachings of the presentinvention to overlap marker colors over the same area of text. Byoverriding one-marker with another, multiple patent notes for the sametext may be created. If a given portion of text within window 160 hasbeen highlighted, multiple color indicators 542 will be displayed in ahorizontal row adjacent to one another. Clicking on any of the colorindicators (542) results in the display of the corresponding patentnote.

Referring now to FIG. 38, the present invention's graphic user interfacepermits multiple copies of the same or different patents to be viewedsimultaneously. In the example illustrated in FIG. 38, window 160 isdisplaying a case copy of the Equivalent File of the '027 patent. As acase copy, the user may search edit, and highlight the Equivalent Fileof the '027 patent displayed in the window 160. As shown, an image ofthe '027 Patent, with an enlarged area from its FIG. 3 is displayed inthe window 410 of screen 68 in FIG. 3. Additionally, a libraryEquivalent copy of the '027 patent is also displayed in a third area 555on display screen 68. As a library copy, the Equivalent File displayedin window 555 is not annotatable or highlightable by the user, but maybe searched, and used for purposes of comparison to the figures inwindow 410, or the notes or other-modifications which a user may make inwindow 160 to the case copy of the Equivalent File.

Referring to FIG. 39, the selection of the Import Patents option fromthe Library menu results in the display of an Import Patents dialog box560. The Import Patents dialog box 560 permits the importation ofadditional patent Equivalent Files into a library file. As illustratedin FIG. 40 and FIG. 3, by placing cursor 44 over a selected EquivalentFile to be imported and momentarily clicking switch 46, the selectedEquivalent File is highlighted. Once the selected patent Equivalent Fileis highlighted, cursor 44 is placed over an OK button 562 and switch 46is once again clicked. The Equivalent File is then imported into thecurrent library file.

FIG. 41 and FIG. 3, the placement of cursor 44 over the Case icon 106and the depression of switch 46 on mouse 42 results in the display of aCase menu bar 570. As illustrated, a variety of sub-command items aredisplayed in Case menu 570, including sub-command items identified asNew Case, Open Case, Update Case, Copy Case, Close Case, Search, SetCase Directories, Backup Case, Restore Case, Delete Case, Print, PrintSetup and Exit. If cursor 44 is placed over the Open Case sub-commanditem and switch 46 is clicked, an Open Case dialog box 580 is displayed(see FIG. 42) which includes an area 587 in which all of the casesaccessible are displayed. In the present example, the case "Infringementstudy" is selected by placing cursor 44 over any portion of the words"Infringement study", or on the briefcase icon 590 adjacent to the words"Infringement study". Alternatively, in the event the New Casesub-command item is selected from case menu 570, the computer 48generates and displays a New Case control box 594 (see FIG. 43). Theuser may define, by inserting into the appropriate fields, a new case byidentifying the case name, attorney name, client name and date in whichthe case was opened. Once these fields have been inputted by the user,an OK button 600 is activated by placing cursor 44 over the OK button600 and momentarily clicking switch 46.

Referring to FIG. 43 and FIG. 2, assume for example that the user hasselected the case identified as "Demonstration" following the stepsdescribed with reference to FIG. 42. By placing cursor 44 over the downarrow 109 on the tool bar 103 and depressing switch 46 on mouse 42,computer 48 generates and displays a menu 610 listing all patentscomprising the case "Demonstration". By placing cursor 44 over any oneof the listed patents comprising the case "Demonstration", and clickingswitch 46, the Equivalent File of the selected patent will be displayed,including any previous highlights, patent notes or other edits made by auser in prior sessions.

Referring to FIG. 45, the selection of the sub-command item Update Casefrom the case menu 570 (see FIG. 41) results in the display of an UpdateCase dialog box 615 in which the current case name, attorney name,client and date of opening of the case are displayed. A user may updatethis data by making the necessary modifications within the appropriatefields and activating an OK button function 620. The selection of theSearch sub-command item from menu 570 results in the display of a SearchCase dialog box 635 illustrated in FIG. 46. Using box 635, the user mayperform searches on patents located in case files, or the library, in amanner similar to that described with reference to the search box 302 inFIG. 24.

Referring now to FIG. 47, the selection of the Set Case Directoriessub-command item of menu 570 (see FIG. 41) results in the generation anddisplay of a Set Case Directories dialog box 640. As illustrated, theSet Case Directories dialog box 640 includes a variety of features forthe manipulation, including addition and removal of directories. The SetCase Directories dialog box 640 also permits the user to define casedefault directories where cases will be created. A warning message willappear when a user accesses this command, since modifications of thisfunction will affect the ability to access cases. In operation, a usermay double-click on a directory illustrated within an area 642 of thedialog box 640 to select the directory and open it. The directoriescontained in the selected directory will be listed below the selecteddirectory within the window 642. Similarly, various drive icons areprovided that represent resources to the computer system of the presentinvention. All directories contained on the selected drive are thenlisted in the directories list. As shown in FIG. 47, the selecteddirectory is identified, as is the default directory. Once a user haschosen a directory from which cases will be found, the activation of abutton function Add Directory 644 adds the selected directory to thedirectories containing cases shown in an area 646. A Remove Directoryfunction 650 removes the selected directory from the path list such thatthe directory will no longer be used when finding cases. A Set asDefault function 652 sets the currently set directory as the defaultdirectory. When setting directories for cases, the default directory isthe directory where new cases are created. When setting directories forlibraries, the default directory is where new libraries are created.

Referring to FIG. 48, the selection of the function Copy Case from menu570 (see FIG. 41) results in the display of a Copy-to-Case dialog box700 illustrated in FIG. 48. The Copy-to-Case dialog box 700 permits auser to copy information from an existing case into the current case. Inoperation, a click down menu is selected and displayed throughactivation of a downward arrow button function 702. The click down menuincludes a listing of all cases which a user may choose from to copyinto the currently active case. In the present specification, the clickdown menu is not shown for sake of brevity. The user then has the optionof identifying case notes, patents, or patent notes to copy from theselected case. If the user checks the case note icon 704, all case notesfrom a selected case will be listed in alphabetic order in the list area710. If the patents icon 706 is selected, all patents from the selectedcase will be listed after any case notes in the list area 710. If thepatent note icon 708 is selected, all patent notes from the selectedcase will be listed after each patent in the list area 710. An area 710is provided within the dialog box 700 in which all patents are listed inascending order from the lowest number to the highest number, as well astheir associated case and patent notes. A user may then click on eachitem within the window 710 which is to be copied.

The selection of the Backup Case option from menu 570 (see FIG. 41)results in a Backup Case dialog box 720 being displayed on screen 68 asshown in FIG. 49. A user may utilize the Backup Case dialog box to savea current case onto a backup disk or directory, or onto anothercomputer. A user enters the backup drive and directory within a displaywindow 722 of the dialog box 720 using keyboard 56. Once entered, thebutton function OK 725 is activated and the case is backed up to thedefined location.

Referring to FIG. 50, a case may be deleted by the selection of the 15Delete function in menu 570 (see FIG. 41) which results in the displayof a delete case dialog box 730 as illustrated in FIG. 50. A case may bedeleted using this dialog box including all case notes and all patentnotes.

The selection of the Print command from menu 570 of FIG. 41 results inthe display of a Print dialog box 750, as shown in FIG. 51. The Printdialog box permits the printing of various files, including but notlimited to, patent images, case and patent notes and Equivalent Files ona printer 57 in FIG. 3 or to a file. Selection of a Print Setup commandin menu 570 of FIG. 41 results in the display of a print setup dialogbox 755, as shown in FIG. 52. The Print Setup dialog box 755 permits auser to set up printers, paper orientation, resolution, size and sourceof paper. In addition, default printers may be selected, and if the useris utilizing a network printer, the printer server information is listedin the print setup dialog box along with the printer/default printersetup options.

Referring now to FIG. 53, the selection of the Edit function from menubar 102 results in the display of Edit menu 760. As illustrated, theEdit menu 760 includes sub-command items Undo, Cut, Copy, Paste, Delete,Find and Find Next, Replace, and Go To Column. As illustrated in FIG.54, the selection of the View command option on menu bar 102 results inthe generation and display of a view menu 765. The View menu optionsinclude preferences, screen layout, and status bar. The selection of thePreferences option on menu 765 results in the display of a Preferencesdialog box 770, as shown in FIG. 55. The Preferences dialog box 770permits a user to set preferences or sorting and opening various patentnotes. Also included in the Preferences dialog box 770 is the option toopen a patent window when the note is created. Patent notes can besorted by title, color and location. The activation of the "Open PatentNote Window When Note is Created" results in the patent note windowbeing displayed every time a highlight is made within window 160. A usermay deactivate the feature so that the note window is not displayed whenthe highlights are done. The selection of the function "Sort Patent Noteby Tide" results in the patent notes being sorted in an alphabetical andascending numerical order. If the patent notes are sorted by color andtitle, then the present invention separates the red, green and yellownotes, and arranges them in alphabetical and numerical order. If thepatent notes are sorted by location in the Equivalent File, the presentinvention lists the patent notes as they appear from the beginning ofthe patent Equivalent text displayed in window 160, to the end of thepatent, disregarding alphabetization and color coding.

The selection of the Screen Layout function of menu 765 in FIG. 54results in the display of a Screen Layout dialog box 780 as shown inFIG. 56. Referring to FIGS. 56 and 57, the Screen Layout dialog boxpermits a user to arrange the Equivalent File in window 160 and theimage displayed within the image window 410 on the screen 68 of FIG. 3.In the presently preferred embodiment, four layout options with thechoice of up to four windows to be active simultaneously are provided tothe user. The four screen layout options are denoted by icons to theleft within an area 782. As shown in FIG. 56, within the area 782, fourlayouts are provided in icon form, namely, layout 783, 785, 787, and790. It will, of course, be appreciated by one skilled in the art thatadditional layouts may be provided within the Screen Layout dialog box780, and supported by the present invention. The selection of icon 783results in the display of two equivalent windows simultaneously. Inoperation, the two equivalent windows may be two views of the samepatent or of different Equivalent Files. The selection of icon 785results in the display of one equivalent window (for example window 160)and one image window (for example image window 410), simultaneously. Theselection of icon 787 results in the display of two equivalent windowsand one image window, simultaneously. The selection of icon 790 resultsin the display of two equivalent windows and two image windowssimultaneously. Additional features of the Screen Layout dialog box 780are provided, including downward arrow icon 800 and a downward arrowicon 805, as shown. The selection of the downward arrow icon 800 resultsin the display of a menu listing of patents in the selected case orlibrary, identified as "Patent 1" in the Screen Layout dialog box 780.The selection of downward arrow icon 805 results in the display of amenu listing the patents within the selected case or library, identifiedas "Patent 2". Placing cursor 44 of FIG. 3 over the case icon 807 orcase icon 809 permits a user to select a patent from the current case.Similarly, placing cursor 44 over a library icon 810 or library icon 812and activating the mouse results in the user selecting a patent from thecurrent library.

Referring again to FIG. 56, in the event the icon 785 is selected, thepresent invention displays a text equivalent window 160 on a left-handportion of screen 68, and an image window 410 on the right portion ofscreen 68 of FIG. 3, as illustrated in FIG. 57. The selection of icon783 results in the display of two equivalent windows simultaneously, asshown in FIG. 58. The selection of icon 790 results in the screen layoutillustrated in FIG. 59. As shown in FIG. 60, an equivalent window 160,an image window 410, and a second equivalent window 835 and a secondimage window 850 are simultaneously displayed on screen 68.

Referring now to FIG. 60, selection of the Window option of the menu bar102 results in the display of a Window menu 900. Window menu 900includes sub-command items Cascade, Tile, Arrange Icons, and a listingof all open windows. Selecting a window from this list will bring thewindow to the top.

The selection of the patent note icon downward arrow 127 (see FIG. 11)results in the display of a menu 902 in FIG. 62 listing all patent noteswhich have been sorted according to the specifications in thepreferences dialog box 770 in FIG. 56. As shown, the patent notesinclude various symbol icons in appropriate colors as well as anumerical indicator of the patent note number. As previously discussed,various markers of different colors are provided for highlightingportions of text of the Equivalent File. Referring now to FIG. 62 inconjunction with FIG. 61, placing cursor 44 of FIG. 3 over any of thepatent note selections in menu 902 and clicking the mouse button resultsin the contents of the note being displayed. In the example of FIG. 62,each patent note includes a title 905, and an area 910 in which the usermay input text through keyboard 56 in FIG. 3. Moreover, as illustratedin FIG. 63, a user has the option of opening multiple patent notessimultaneously. As will be appreciated from FIG. 63, each patent noteincludes text, as well as the notation as to the column and line numberof the highlight to which the patent note corresponds. Moreover, ageometric shape indicator 909 (for example, in FIG. 63, a square,triangle, and circle) in the appropriate color corresponds to thehighlight in the Equivalent File within, for example, window 160 in FIG.63. In the example of FIG. 63, the user has selected the multi-note modeand created notes using different colored highlighter pens within theEquivalent File displayed in window 160.

Continuing to refer to FIG. 63, the present invention can copy and pastea portion of, or an entire external file, such as a deposition, aninterrogatory, or an article into the patent note. A user can alsosearch for a term or terms in the patent note. In addition, when theuser clicks on a portion of the text in the patent note or an indicator909, a portion of the text relating to the particular patent note isdisplayed in the equivalent window 160.

FIG. 64 illustrates a case note window entitled "Case Note 2". As shown,Case Note 2 comprises a window 920 having a case note title area 925 andan open text area 930. A user may define the case note title 925, andinput text regarding a particular case directly into area 930 using thekeyboard 56 in FIG. 3. In addition, the user can copy and paste aportion of or a whole file into a case note as well as search for aterm, or terms in the case notes.

FIG. 65 illustrates the minimization of libraries, patents, searchresults, patent and case notes to icons to conserve real estate space onscreen 68 in FIG. 3. In as much as the minimization of documents and thelike are known in the art, no further description of the minimizationutilized by the present invention is provided in this specification.

FIG. 66 illustrates the Go To dialog box 955 of the present inventionwhich permits a user to directly input a column number into a field 960,using keyboard 56 in FIG. 3. The activation of an OK button 965 resultsin the display of the designated column of the Equivalent File in theequivalent window 160. In practice, it has been found that the quickestmethod for locating and displaying a specific column in the EquivalentFile is through the use of the Go To column dialog box 955. FIG. 67illustrates the present invention's Go To Section dialog box 970. Thedialog box 970 includes the downward pointing arrow function 972, theactivation of which results in the display of all sections in theparticular patent displayed within the equivalent window 160. The Go ToSection dialog box 970 is provided as a method for navigating fromsection to section within the Equivalent File displayed in theequivalent window 160. The selection of a section, and the subsequentdisplay of the section, in the equivalent window 160 obviates the needto search page by page for the desired section in the Equivalent File.

Referring now to FIG. 68, the selection of the Help function from menu102 results in the display of a Help menu 980. Help menu 980 includesvarious sub-command options, including Help Index, Getting Started,Learning PatentWorks™, and About. The selection of the About commandresults in the display of an About information box 982, shown in FIG.69. The About dialog box 982 lists information on the inventioncomprising the subject of this application, copyright information andother pertinent information related to the product.

Referring to FIG. 70, the selection of the Note function of menu 102results in the display Note menu 987. The Note menu 987 includes avariety of options including New Case Note, View Case Note, View PatentNote, Find, Find Next, Replace, and Go To Highlighted Text. Theselection of the View Case Note option results in the display of a CaseNotes in Case dialog box 990 as shown in FIG. 71. Box 990 permits a userto select a case note to view. An area 992 in box 990 lists all of thecase notes in the current case. The user may then select a single casenote to view.

The selection of the View Patent Note option of menu 987 of FIG. 70results in the display of a Patent Notes in Case dialog box 994 shown inFIG. 72. Box 994 permits a user to select a patent note to view or todelete. Patent notes are sorted as specified in the preferences dialogbox 770 in FIG. 56, then by the patent note number. A user may selectone of the patent notes by placing the cursor 44 of FIG. 3 over the noteand by clicking the switch on the mouse.

The user interface of the present invention includes a number ofadditional features, as described below.

Flexible Notes

According to one embodiment of the present invention, each patent noteis associated with one patent (a patent note can be associated with oneor more text portions of a patent). Patent notes and subnotes are storedin a note database, such as note database 8702 shown in FIG. 87. Notedatabase 8702 is associated with a single patent, and stores all thenotes and subnotes created by the user for the patent.

Note database 8702 includes a row 8704 for each subnote associated withthe patent. Note database 8702 may also include entries to identifywhich subnotes are associated with which master notes (a master note isa note that contains one or more subnotes). Each row 8704 includes abegin location field 8706, an end location field 8708, a pen color field8710, and a subnote text field 8712. The begin location field 8706 andthe end location field 8708 store information that identify the text inthe patent associated with the subnode. In particular, the beginlocation field 8706 stores information (such as column and line number)where this text in the patent begins, and the end location field 8708stores information (such as column and line number) where this text inthe patent ends. (In an embodiment where a subnote can be linked to morethan one text portion of a patent, there is a begin location value andan end location value for each such text portion.)

The pen color field 8710 stores information that identifies the color ofthe pen which the user utilized to highlight this text of the patent(i.e., the text in the patent with which the patent subnote isassociated). (In an embodiment where a note can be linked to more thanone text portion of a patent, there is a pen color value for each suchtext portion.) The subnote text field 8712 stores the actual text of thepatent subnote.

The present invention retrieves information from the note database 8702as requested by the user and displays such information in a note windowsuch as that shown in FIG. 63. Whenever the user adds, deletes, ormodifies a patent subnode, the present invention reflects this addition,deletion, or modification in the appropriate note database. The userinterface which the present invention provides to allow users tointeract and manipulate patent notes and subnotes is discussed above.The invention enables users to perform keyword searches through any onenote.

According to an alternative embodiment of the present invention, eachpatent note is associated with any or all of the patents in a case. Inparticular, each subnote of a note can be associated with any or all ofthe patents in a case. The user specifies the patents in a caseassociated with each patent subnote.

In this alternative embodiment, patent notes are stored in a notedatabase, such as note database 8802 shown in FIG. 88. Note database8802 is associated with a case, and stores all of the notes and subnotescreated by the user for the patents in the case.

Note database 8802 includes a row 8804 for each subnote associated withthe patents in a case. Each row 8804 includes a patent(s) field 8814, abegin location(s) field 8806, an end location(s) field 8808, a pencolor(s) field 8810, and a note text field 8812. The note database 8802may also include rows to identify which subnotes are associated withwhich master notes.

The patent(s) field 8814 lists all of the patents associated with thepatent subnote. The begin location(s) field 8806 and the end location(s)field 8808 store information that identify the text in the patentsassociated with the patent subnote. There is a begin location value andan end location value for each patent listed in the patent(s) field8814. (In an embodiment where a subnote can be linked to more than onetext portion of a patent, there is a begin location value and an endlocation value for each such text portion in each patent listed in thepatent(s) field 8814.)

The pen color(s) field 8810 stores information that identifies the pencolors which the user utilized to highlight text in the patents listedin the patent(s) field 8814. There is a pen color value for each patentlisted in the patent(s) field 8814. (In an embodiment where a subnotecan be linked to more than one text portion of a patent, there is a pencolor for each such text portion in each patent listed in the patent(s)field 8814.)

The subnote text field 8812 stores the actual text of the patentsubnote.

As just described, a patent subnote may be associated with any of thepatents in a single case. In another embodiment of the invention, apatent subnote may be associated with any patents stored in the system,irrespective of case.

The present invention retrieves information from the note database 8802as requested by the user and displays such information in a note window8902 such as that shown in FIG. 89. In this example note window 8902, afirst patent subnote, a second patent subnote, and a third patentsubnote are being displayed in scroll windows 8904, 8906, and 8908,respectively. The patent(s) associated with these patent subnotes areindicated in display areas 8910, 8912, and 8914, respectively.Specifically, the-first patent subnote is associated with U.S. Pat. No.7,777,777, the second patent subnote is associated with U.S. Pat. Nos.5,555,555 and 6,666,666, and the third patent subnote is associated withU.S. Pat. Nos. 4,444,444; 5,555,555; and 7,777,777. According to anembodiment of the present invention, a user can display a patent (and,in particular, the section of the patent containing the text associatedwith the subnote) by double clicking on the patent number listed indisplay areas 8910, 8912, and 8914.

When a user creates a patent note or subnote (by highlighting a sectionof text in a patent using a color pen, as shown in FIG. 61), the user isallowed to create an entirely new patent note, or to form a linkage toan already existing patent note. The user creates an entirely new patentnote by selecting a section of text (with a pen) while the note window8902 is closed. The user forms a linkage to an already existing patentnote by selecting a section of text (with a pen) while the note window8902 is open. The invention then creates a new subnote in the note beingdisplayed in the note window 8902.

The invention enables users to perform keyword searches through all ofthe notes and subnotes (or any subset of such notes and subnotes)associated with a case. Preferably, the invention performs linearsearches through the notes and subnotes (i.e., word by word textcomparisons), although other well known searching strategies couldalternatively be used.

According to an alternative embodiment of the present invention, apatent note is associated with any or all of the patents in a case.However, each subnote can be associated with only one of the patents ina case. In this embodiment, patent notes are stored in a note database,such as note database 8802 shown in FIG. 88 and described above. In thiscase, however, the patent field 8814 in each row 8804 stores informationidentifying only one patent (again, each row 8804 corresponds to apatent subnote).

Subnotes of a note are displayed in a note window 8902 such as thatshown in FIG. 89 and described above. In this case, however, the displayareas 8910, 8912, and 8914 each identify a single patent.

The embodiments described above are directed to notes and subnoteslinked to portions of patent text. It should be understood, however,that the invention is also directed to notes and subnotes linked tonon-textual and/or non-patent objects. In particular, notes and subnotesmay be linked to patent related and non-patent related documents,images, audio clips, etc.

The subnotes for a note are displayed in a notes window 9010 of adisplay 9002 as shown in FIG. 90. This notes window 9010 is similar tothat shown in FIG. 89. Associated with each subnote 9012, 9014, 9016being displayed is a source and location field 9018, 9020, 9022displaying information that identifies the items and locations in theitems respectively associated with the subnotes 9012, 9014, 9016.

The notes window 9010 is on top of a text window 9004 and an imagewindow 9006 (alternatively, multiple text windows and/or multiple imagewindows could be displayed). The text window 9004 displays therein atext document, such as patent or non-patent related text. The imagewindow 9006 displays an image, which may be patent or non-patentrelated.

A user creates a new subnote in the note displayed in the notes window9010 by using a pen (from a pen tool bar 9008) to select a portion oftext from the text window 9004, or a portion of an image displayed inthe image window 9006. Upon selecting such text or image, the inventioncreates a new subnote in the notes window 9010, this new subnote beingassociated with the selected item. The user can then enter informationpertaining to the selected text/image in the new subnote. (If the noteswindow 9010 was not open at the time that the user selected thetext/image, then a new note would have been created.)

As indicated above, subnotes can also be associated with other types ofitems, such as audio and/or video clips. In this case, informationidentifying such audio and/or video clips is displayed in a window (forexample, a window containing icons each corresponding to an audio clipor a video clip is displayed in a window). A user creates a new note orsubnote by using a pen (from the pen tool bar 9008) to select one ofthese icons.

In this embodiment, information for notes and subnotes are stored in anote database, such as the note database 8802 shown in FIG. 88 anddescribed above. In this case, the patent column 8814 stores informationidentifying the item associated with the respective subnote.

Copy Claims

As described above, during pagination the PTO text file is analyzed toidentify Section headings (see FIG. 10 and the section above entitled"Initial Automatic Pagination"). During such processing, a linked list7902 (shown in FIG. 79) is generated. The actual implementation of thelinked list 7902 is implementation dependent. For example, the linkedlist 7902 may be implemented as a doubly linked list (this is the caseshown in FIG. 79).

The linked list 7902 is synchronized with the corresponding text file7908, and is used to quickly navigate through the text file 7908. Thelinked list 7902 includes a record 7904A-7904N for each section in thetext file 7908 (in the example of FIG. 79, the text file has Nsections). Each of these records 7904A-7904N includes a pointer7906A-7906N that points to the top of the corresponding section in thetext file 7908. For example, record 7904A corresponds to Section 1 inthe text-file 7908. This record 7904A contains a pointer 7906A thatpoints to the top of Section 1 in the text file 7908. The records7904A-7904N may also include other information, such as the sectionnames (represented in FIG. 79 as "1", "2", . . . , "N").

The user interface of the present invention includes a user-selectiblefeature called "Copy Claims to Clipboard". This is an option that isavailable from the menu bar. When the user selects this option, theclaims are automatically copied to the clipboard (or, alternatively, toa user specified file). It should be understood that this feature isalso applicable in a nonpatent context. For example, this feature can beused to automatically copy any user specified section from the text fileto the clipboard (or to a user specified file). However, this feature isparticularly useful in a patent context. For example, this featuregreatly simplifies the generation of claim charts that are oftenincluded as part of validity and infringement opinions.

The operation of the present invention according to this feature isdepicted in a flowchart 8002 in FIG. 80. Flowchart 8002 begins with step8004, where control immediately passes to step 8006.

In step 8006, the last section heading in the text file is automaticallylocated (as will be appreciated, the last section in a patent is theclaims section). This is preferably done by using the linked list 7902.In particular, the last section heading in the text file 7908 is locatedby following the pointer 7906N in the last record 7904N of the linkedlist 7902.

In step 8008, all text in the text file that follows the last sectionheading is automatically extracted from the text file.

In step 8010, this extracted text is copied to the clipboard (or to auser specified file). Operation of flowchart 8002 is complete afterperformance of step 8010, as indicated by step 8012. It should beunderstood that these operations are performed automatically, withoutany input or direction from the user.

Zoom Image

The user interface of the present invention includes a "Zoom Image"feature that may be selected by an operator. The operator can use thisfeature to magnify a selected portion of the image currently beingdisplayed. As discussed above with reference to FIGS. 75, 76A, and 76B,the uncompressed PTO image file is compressed to a 1D (one dimensional)image file. This is done to reduce storage requirements. According tothe present invention, zooming operations involve a transformation fromthe 1D image file directly to the image that is ultimately displayed onthe monitor. This is done to increase the overall speed of the presentinvention.

The manner in which the zoom image operation is performed is generallydepicted in FIG. 81, which shows a 1D compressed image 8102. Forillustrative purposes, this 1D compressed image 8102 is shown as havingfour lines 8104A-8104D. The compressed image 8102 is supplied to a datadecompressor 8106, which is preferably implemented as a processoroperating according to control logic (software). Alternatively, the datadecompressor 8106 is implemented as a hardware state machine. The datadecompressor 8106 decompresses the compressed image 8102 to the degreeindicated by the operator's zoom request. For example, the operator mayhave requested a 75 DPI (dot per inch) or a 150 DPI zoom. Operation ofthe data decompressor 8106 results in a decompressed image 8108, that isdisplayed in the image window 8110.

The operation of the data decompressor 8106 shall now be furtherdescribed with reference to a flowchart 8202 in FIG. 82. Flowchart 8202begins with step 8204, where control immediately passes to step 8206.

In step 8206, the operator selects the zoom image option. The operatordoes so in order to zoom a portion of an image currently beingdisplayed. Preferably, the portion of the image that is to be zoomed isindicated by the current position of the cursor or mouse (i.e., theportion of the image proximate to the current position of the cursor ormouse is the area that is to be zoomed), although other selectionmechanisms could alternatively be employed.

Also in step 8206, the operator indicates the desired zoom level.Preferably, there are three zoom levels: 300 DPI (dots per inch), 150DPI, and 75 DPI. However, the present invention can alternativelysupport additional zoom levels. The operation of the present inventionaccording to such additional zoom levels will be apparent to personsskilled in the relevant art based on the discussion contained herein.

A zoom level denotes both magnification level and resolution (i.e., theamount of data used to represent the final, zoomed image). According tothe present invention, the 300 DPI zoom level is the greatestmagnification level, and is of the highest resolution. The 150 DPI zoomlevel is one half the magnification of the 300 DPI level, and is onehalf the resolution (that is, the 150 DPI level uses one fourth the datato represent the final image as the 300 DPI level). The 75 DPI zoomlevel is one fourth the magnification of the 300 DPI level, and is onefourth the resolution (that is, the 75 DPI level uses one sixteenth thedata to represent the final image as the 300 DPI level).

If the operator selected the 300 DPI zoom level in step 8206, then step8208 is performed. In step 8208, the data decompressor 8106 fullydecompresses that portion of the 1D compressed image corresponding tothe portion of the image currently being displayed that the operatorwishes to zoom (this portion of the image currently being displayed wasselected in step 8206). The phrase "fully decompresses" means that thedecompressor 8106 decompresses every bit in every line of this portionof the 1D compressed image. Procedures for performing this decompressionoperation will be apparent to persons skilled in the relevant art. Instep 8214, the data decompressor 8106 transfers this decompressed datato the image window 8110 for display.

If, instead, the operator selected the 150 DPI zoom level in step 8206,then step 8210 is performed. In step 8210, the data decompressor 8106partially decompresses that portion of the 1D compressed imagecorresponding to the portion of the image currently being displayed thatthe operator wishes to zoom. Specifically, the decompressor 8106decompresses every other bit in every other line of this portion of the1D compressed image. The decompressor 8106 is able to ignore (i.e., notdecompress) all other bits in all other lines since, as discussed above,the 150 DPI level uses one fourth the data to represent the final imageas the 300 DPI level. This results in a significant increase in overallsystem processing speed (since it is necessary to only decompress aportion of the 1D compressed image). In step 8214, the data decompressor8106 transfers this decompressed data to the image window 8110 fordisplay.

If, instead, the operator selected the 75 DPI zoom level in step 8206,then step 8212 is performed. In step 8212, the data decompressor 8106partially decompresses that portion of the 1D compressed imagecorresponding to the portion of the image currently being displayed thatthe operator wishes to zoom. Specifically, the decompressor 8106decompresses every fourth bit in every fourth line of this portion ofthe 1D compressed image. The decompressor 8106 is able to ignore (i.e.,not decompress) all other bits in all other lines since, as discussedabove, the 75 DPI level uses one sixteenth the data to represent thefinal image as the 300 DPI level. Again, this results in a significantincrease in overall system processing speed. In step 8214, the datadecompressor 8106 transfers this decompressed data to the image window8110 for display. The operation of flowchart 8202 is complete after step8214 is performed, as indicated by step 8216.

Copy Image

The user interface of the present invention includes a "Copy Image"feature where, if selected by the operator (preferably via a menuoption), the image currently being displayed is copied to either theclipboard or a user specified file. This option shall now be describedwith reference to a flowchart 8302 in FIG. 83. Flowchart 8302 beginswith step 8304, where control immediately passes to step 8306.

In step 8306, the operator selects the portion of the current image(i.e., the image currently being displayed) that he/she wishes to copy.The operator is permitted to do this in any well known fashion, forexample, by using the mouse.

In step 8308, the operator selects the "Copy Image" option, and selectsthe desired resolution. Currently, there are four resolution levels(although the present invention can alternatively support additionalresolution levels): screen resolution, 300 DPI, 150 DPI, and 75 DPI.Also in step 8308, the operator indicates whether he/she wishes to copyto the clipboard or to a file (which must be specified by the operator).

If the operator selected "screen resolution" in step 8308, then step8310 is performed. In step 8310, the current image as it is currentlybeing displayed is copied to either the clipboard or the user specifiedfile. Preferably, this is accomplished by copying the contents of theimage window (as represented in display memory, for example) to theclipboard or the user specified file.

If, instead, the operator selected "300 DPI resolution" in step 8308,then step 8312 is performed. In step 8312, the portion of the 1Dcompressed image corresponding to the portion of the current image thatthe operator wants to copy (as selected in step 8306) is fullydecompressed. As discussed above, the phrase "fully decompressed" meansthat every bit in every line of this portion of the 1D compressed imageis decompressed. In step 8314, this decompressed data is copied to theclipboard or the user specified file.

If, instead, the operator selected "150 DPI resolution" in step 8308,then step 8318 is performed. In step 8318, the portion of the 1Dcompressed image corresponding to the portion of the current image thatthe operator wants to copy (as selected in step 8306) is partiallydecompressed. Specifically, every other bit in every other line of thisportion of the 1D compressed image is decompressed. Such partialdecompression is possible since, as discussed above, the 150 DPI leveluses one half the data to represent the final image as the 300 DPIlevel. This results in a significant increase in overall systemprocessing speed (since it is necessary to only decompress a portion ofthe 1D compressed image). In step 8314, this decompressed data is copiedto the clipboard or the user specified file.

If, instead, the operator selected "75 DPI resolution" in step 8308,then step 8320 is performed. In step 8320, the portion of the 1Dcompressed image corresponding to the portion of the current image thatthe operator wants to copy (as selected in step 8306) is partiallydecompressed. Specifically, every fourth bit in every fourth line ofthis portion of the 1D compressed image is decompressed. Such partialdecompression is possible since, as discussed above, the 75 DPI leveluses one fourth the data to represent the final image as the 300 DPIlevel. Again, this results in a significant increase in overall systemprocessing speed. In step 8314, this decompressed data is copied to theclipboard or the user specified file. Processing of flowchart 8302 iscomplete after step 8310 or 8314 is performed, as indicated by step8316.

Lock Windows

The user interface of the present invention includes a "lock windows"option that may be selected by an operator (preferably via a menuoption). The operation of the present invention when the "lock windows"option is selected is represented by flowchart 8402 in FIG. 84. When the"lock windows" option is selected, the position of all windows currentlybeing displayed is locked (step 8406). In other words, all movement ofthe windows currently being displayed is disabled, so that it is notpossible for the operator to move the windows.

This option is useful to prevent operators from accidentally moving thewindows. Such inadventant movement of windows is an inconvenience sinceit often results in the scroll bars of such windows being pushed out ofthe display screen, thereby making it difficult for operators tonavigate through the windows.

Conclusion

As described herein, the present invention is implemented to process anddisplay patent text and image files. However, the present invention maybe utilized for any application where there is text data and image datathat must be analyzed or manipulated in a synchronized and paginatedformat fashion. One such application is the processing of magazine orbook electronic data. This data is commonly stored as text data similarto how patent text data is stored as described in the specification.Magazine and similar data would be far more useful if it were paginatedwith images of the actual magazine pages. The text and image data couldbe paginated to produce an Equivalent File that would contain pagenumber, columns numbers within each page, font style information, andposition of the first character in each page, column and line increasingease of navigation and citation.

Users may perform analysis on publications, for example, in twoside-by-side windows, users may perform text searches on the text andthen study the diagrams that go with the text. A tremendous amount oflegacy data exists in magazine and book form that has not been stored inelectronic data form. This data could be used in a similar fashion bysimply replacing the pagination process discussed above with apagination process that uses no existing text data, and instead uses anoptical character reader only to recover the text information from theimages. If there are no image files, then they can easily be produced byscanning the original printed materials. These images could be stored incolor format if color is important in the application.

While the present invention has been described in conjunction with a fewspecific embodiments identified in FIGS. 1 through 86, it will beapparent to those skilled in the art that many alternatives,modifications and variations in light of the foregoing description arepossible. Accordingly, the present invention is intended to embrace allsuch alternatives, modifications and variations as may fall within thespirit and scope of the invention as disclosed. Moreover, due to thelimitations of a written black and white specification and drawings, thereader is referred to the video tape entitled "PatentWorks™", thePatentWorks™ Manual, and the computer program under the same name,submitted with the filing of the parent application on which this patentis based. Since many of the features of the present invention involvedynamic events and the use of color, the viewing of the video tape, anduse of the program, submitted to the United States Patent and TrademarkOffice is advised to thoroughly understand the nature of the presentinvention as disclosed above.

What is claimed is:
 1. A method of manipulating notes having subnoteslinked to portions of documents, comprising the steps of:(1) accessing anote database comprising subnote information for each of said subnotes,said subnote information for said each of said subnotes comprising atleast one location field and at least one subnote content field, said atleast one location field storing location information identifying aportion of a document that is linked to said each of said subnotes, andsaid at least one subnote content field storing informational contentassociated with said each of said subnotes; (2) receiving a user commandto display said each of said subnotes; and (3) displaying said each ofsaid subnotes, comprising the step of:(a) displaying in a subnote windowsaid informational content from said at least one subnote content fieldof said subnote information associated with said each of said subnotes.2. The method of claim 1, wherein step (3) further comprises the stepof:(b) displaying, by reference to said at least one location field ofsaid subnote information associated with said each of said subnotes,information identifying said portion of said document that is linked tosaid each of said subnotes.
 3. The method of claim 1, wherein saidsubnote information for said each of said subnotes also comprises atleast one display attribute field.
 4. The method of claim 3, whereinstep (3) further comprises the step of:displaying at least a part ofsaid each of said subnotes and at least a part of said portion of saiddocument linked to said each of said subnotes using at least one displayattribute specified in said at least one display attribute field.
 5. Themethod of claim 1, wherein said subnote information for said each ofsaid subnotes also comprises at least one source field havinginformation identifying any data objects to which said each of saidsubnotes is linked.
 6. The method of claim 5, wherein said at least onesource field for said each of said subnotes stores information thatindicates that said each of said subnotes is linked to at least portionsof a plurality of data objects.
 7. The method of claim 5, wherein step(3) further comprises the step of:displaying from said at least onesource field associated with said each of said subnotes informationidentifying data objects having at least portions to which said each ofsaid subnotes is linked.
 8. A system of manipulating notes havingsubnotes linked to portions of documents, comprising:note databaseaccessing means for accessing a note database comprising subnoteinformation for each of said subnotes, said subnote information for saideach of said subnotes comprising at least one location field and atleast one subnote content field, said at least one location fieldstoring location information identifying a portion of a document that islinked to said each of said subnotes, and said at least one subnotecontent field storing informational content associated with said each ofsaid subnotes; command receiving means for receiving a user command todisplay said each of said subnotes; and displaying means for displayingsaid each of said subnotes, comprising means for displaying in a subnotewindow said informational content from said at least one subnote contentfield of said subnote information associated with said each of saidsubnotes.
 9. The system of claim 8, wherein said displaying meansfurther comprises:means for displaying, by reference to said at leastone location field of said subnote information associated with said eachof said subnotes, information identifying said portion of said documentthat is linked to said each of said subnotes.
 10. The system of claim 8,wherein said subnote information for said each of said subnotes alsocomprises at least one display attribute field.
 11. The system of claim10, wherein said displaying means further comprises:means for displayingat least a part of said each of said subnotes and at least a part ofsaid portion of said document linked to said each of said subnotes usingat least one display attribute specified in said at least one displayattribute field.
 12. The system of claim 8, wherein said subnoteinformation for said each of said subnotes also comprises at least onesource field having information identifying any data objects to whichsaid each of said subnotes is linked.
 13. The system of claim 12,wherein said at least one source field for said each of said subnotesstores information that indicates that said each of said subnotes islinked to at least portions of a plurality of data objects.
 14. Thesystem of claim 12, wherein said displaying means furthercomprises:means for displaying from said at least one source fieldassociated with said each of said subnotes information identifying dataobjects having at least portions to which said each of said subnotes islinked.
 15. A computer program product having control logic recordedtherein, said control logic, when executed, causing a computer tomanipulate notes having subnotes linked to portions of documents, saidcontrol logic comprising:note database accessing means for enabling thecomputer to access a note database comprising subnote information foreach of said subnotes, said subnote information for said each of saidsubnotes comprising at least one location field and at least one subnotecontent field, said at least one location field storing locationinformation identifying a portion of a document that is linked to saideach of said subnotes, and said at least one subnote content fieldstoring informational content associated with said each of saidsubnotes; command receiving means for enabling the computer to receive auser command to display said each of said subnotes; and displaying meansfor enabling the computer to display said each of said subnotes,comprising means for enabling the computer to display in a subnotewindow said informational content from said at least one subnote contentfield of said subnote information associated with said each of saidsubnotes.
 16. The computer program product of claim 15, wherein saiddisplaying means further comprises:means for enabling the computer todisplay, by reference to said at least one location field of saidsubnote information associated with said each of said subnotes,information identifying said portion of said document that is linked tosaid each of said subnotes.
 17. The computer program product of claim15, wherein said subnote information for said each of said subnotes alsocomprises at least one display attribute field.
 18. The computer programproduct of claim 17, wherein said displaying means furthercomprises:means for enabling the computer to display at least a part ofsaid each of said subnotes and at least a part of said portion of saiddocument linked to said each of said subnotes using at least one displayattribute specified in said at least one display attribute field. 19.The computer program product of claim 15, wherein said subnoteinformation for said each of said subnotes also comprises at least onesource field having information identifying any data objects to whichsaid each of said subnotes is linked.
 20. The computer program productof claim 19, wherein said at least one source field for said each ofsaid subnotes stores information that indicates that said each of saidsubnotes is linked to at least portions of a plurality of data objects.21. The computer program product of claim 19, wherein said displayingmeans further comprises:means for enabling the computer to display fromsaid at least one source field associated with said each of saidsubnotes information identifying data objects having at least portionsto which said each of said subnotes is linked.